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Kenneth  L.  Clarkson2,  Herbert  E dels b runner3,  Leonidas  J.  Guibas4, 
Micha  Sharir5  and  Emo  VVeLzl6 


Abstract 

We  present  upper  and  lower  bounds  for  extremal  problems  defined  for  arrangements  of  lines, 
circles,  spheres  and  alike.  For  example,  we  prove  that  the  maximum  number  of  edges  bounding  m 
cells  in  an  arrangement  of  n  lines  is  0(m2/37i-/3  +  n)T  and  that  it  is  0(m:/3n:/3;3(r»)-*-n)  for  n  unit- 
circles,  where  3(n)  (and  later  J(m,  n))  is  a  function  that  depends  on  the  inverse  of  Ackermann's 
function  and  grows  extremely  slowly.  If  we  replace  unit-circles  by  circles  of  arbitrary  radii  the 
upper  bound  goes  up  to  0(m3/sn4^5/3(n)  +  n).  The  same  bounds  (without  the  ,3(n)-terms)  hold 
for  the  maximum  sum  of  degrees  of  m  vertices.  In  the  case  of  vertex  degrees  in  arrangements 
of  lines  and  of  unit-circles  our  bounds  match  previous  results,  but  our  proofs  are  considerably 
simpler  than  the  previous  ones.  The  maximum  sum  of  degrees  of  m  vertices  in  an  arrangement  of 
n  spheres  in  three  dimensions  is  0(m4/  n9/7/3(m,  n)  — rc:),  in  general,  and  0(m3/,4n3,43(m,  n)  —  n) 
if  no  three  spheres  intersect  in  a  common  circle.  The  latter  bound  implies  that  the  maximum 
number  of  unit-distances  among  m  points  in  three  dimensions  is  0(m3;2f3(m))  which  improves 
the  best  previous  upper  bound  on  this  problem.  Applications  of  our  results  to  other  distance 
problems  are  also  given. 

Keywords.  Discrete  geometry,  extremal  problems,  bipartite  graphs,  arrangements  of  curves 
and  surfaces,  triangulations,  probabilistic  counting,  zones,  Davenport-Schinzel  sequences, 
counting  distances. 
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1      Introduction 

Combinatorial  distance  problems  for  finite  point  sets  have  a  long  history  in  the  mathematical  liter- 
ature (see  '48,  50]).  Some  of  this  literature  was  originated  by  the  following  question  asked  by  Paul 
Erdos  in  1946   24]  (see  aiso  [25]): 

"What  is  the  maximum  number  of  pairs  in  a  set  of  m  points  in  two  (or  three)  dimensions 
that  are  exactly  at  distance  1  from  each  other?" 

By  scaling  we  can  make  any  distance  the  unit-distance,  so  this  question  is  equivalent  to  asking  how 
often  the  most  popular  distance  can  occur.  This  seemingly  innocent  problem  turned  out  to  be  one  of 
the  most  difficult  problems  in  combinatorial  geometry,  and  it  is  still  far  from  being  solved  —  in  spite 
of  some  recent  progress  in  its  analysis  to  which  this  paper  significantly  contributes.  In  particular,  we 
give  a  new  and  simpler  proof  of  the  0(m4^3)  upper  bound  in  the  plane  [54]  and  improve  the  upper 
bound  in  three  dimensions  from  0(ms/s)  '10]  to  0(m3/2J(rn)),  where  3(m)  is  an  extremely  slowly 
growing  function.  The  two  upper  bounds  are  corollaries  of  more  general  results  about  arrangements 
of  curves  in  two  dimensions  and  surfaces  in  three  dimensions  obtained  in  this  paper.  We  change 
gears  now  and  introduce  these  more  general  concepts  and  talk  about  the  philosophy  that  motivates 
and  directs  our  work. 

Let  us  consider  a  collection  H  =  {/ii,  /12, . . .,  hn\  of  n  bounded  or  unbounded  (d  -  l)-dimensional 
manifolds  in  the  Euclidean  d- dimensional  space.  The  arrangement  A(H)  is  the  subdivision  of  the 
space  denned  by  these  manifolds.  Intuitively,  we  think  of  space  as  being  filled  with  some  solid 
material;  the  space  is  then  "cut"'  along  each  of  the  manifolds  in  H,  resulting  in  a  collection  of 
connected  pieces,  called  cells  or  d-faces.  The  formal  notion  of  an  arrangement  includes  not  only 
these  "'top-lever  faces,  but  also  the  lower-dimensional  ones  that  form  the  boundaries  of  these  d- 
dimensional  faces,  as  well  as  the  incidence  relationships  between  all  these  faces  of  various  dimensions. 
Arrangements  of  lines  and  planes  have  been  extensively  considered  in  the  literature;  see  [34  or  15' 
for  further  details.  For  additional  material  on  the  data  structures  needed  to  capture  the  topological 
information  present  in  an  arrangement,  see  also  [35,  14] . 

As  an  example,  if  the  hi  are  straight  lines  in  the  Euclidean  plane,  then  the  arrangement  of  these 
lines  is  a  decomposition  of  the  plane  into  open  convex  regions  (the  2-faces  or  cells),  relatively  open 
line  segments  bounding  these  convex  regions  (the  l-faces  or  edges),  and  the  intersection  points  of 
the  lines  /i,  (the  0-faces  or  vertices).  Every  point  of  the  plane  belongs  to  exactly  one  face  of  the 
arrangement  A(H)  when  these  additional  lower-dimensional  faces  are  considered. 

Arrangements  are  ubiquitous  structures  in  geometric  computing:  many  geometric  objects  of 
interest  can  naturally  be  specified  as  a  collection  of  cells  or  lower-dimensional  faces  in  an  appropriate 
arrangement.  For  instance,  the  convex  hull  of  n  points  in  the  plane  corresponds  to  the  cell  containing 
the  origin  in  the  arrangement  of  the  lines  dual  to  the  points  [15].  The  Voronoi  diagram  of  n  points  in 
the  plane  can  be  viewed  as  the  planar  projection  of  a  cell's  boundary  complex  in  a  three-dimensional 
arrangement  of  planes  [49.  15]  or  cones  [31].  In  general,  semi-algebraic  sets  (that  is,  geometric 
sets  specified  by  unions  and  intersections  of  half-spaces  bounded  by  algebraic  surfaces)  naturally 
correspond  to  collections  of  faces  in  the  arrangement  of  the  underlying  surfaces.  In  fact,  a  whole 
branch  of  solid  modeling  has  arisen,  constructive  solid  geometry  [46],  based  on  solid  shape  descriptions 
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in  this  boolean- tree  fashion. 

Although  arrangements  are  commonly  occurring  objects,  they  axe  expensive  to  manipulate  and 
to  store  in  their  entirety.  An  arrangement  of  hyperpianes  in  Ed  takes  space  0(nJ)  [15],  so  even  an 
arrangement  of  lines  in  E2  is  an  unwieldy  object.  Of  course,  if  our  computation  requires  us  to  visit 
every  cell  of  the  arrangement,  then  we  must  pay  at  least  this  Q(nd)  cost  in  time.  In  some  situations 
L6],  however,  we  can  sweep  over  the  arrangement  by  a  topological  wavefront  while  performing  the 
computation,  thus  reducing  the  working  storage  required  by  our  algorithm  to  that  of  storing  only 
the  cells  intersected  by  the  wavefront.  plus  some  auxiliary  structures.  We  omit  further  details  here, 
because  in  this  paper  we  want  to  focus  on  situations  where  not  all  but  only  some  of  the  cells  are  of 
interest.  We  will  further  assume  that  the  "interesting"  cells  are  given  to  us  not  implicitly,  as  in  the 
boolean-tree  paradigm,  but  instead  by  explicitly  indicating  a  point  contained  in  each  cell  we  must 
consider. 

In  this  and  several  companion  papers  [19.  20)  we  deal  with  the  combinatorial  problem  of  estimating 
the  worst-case  combinatorial  size  of  m  cells  in  an  arrangement  of  n  manifolds,  and  the  algorithmic 
problem  of  computing  (a  boundary  representation)  of  those  cells  —  for  various  dimensions,  and  for 
various  types  of  manifolds.  We  have  a  collection  of  powerful  tools  which  in  various  combinations 
can  be  used  to  attack  each  specific  problem.  Section  2  will  spell  out  the  specific  results  contained 
in  this  paper.  For  now  we  just  wish  to  remark  that  our  algorithmic  and  combinatorial  tools  are 
closely  intertwined.  We  design  efficient  algorithms  because  we  have  access  to  detailed  combinatorial 
knowledge  about  the  underlying  geometric  objects.  At  the  same  time,  we  axe  able  to  prove  purely 
combinatorial  results  by  using  algorithms  as  proof  tools.  We  believe  that  this  work  fits  well  within  the 
paradigm  of  the  analysis  of  algorithms  initiated  by  Knuth  [41,  42,  43).  We  should  remark,  however, 
that  in  contrast  to  [L9.  20)  this  paper  is  predominantly  combinatorial  and  refrains  from  discussing 
algorithmic  issues  that  do  not  also  serve  combinatorial  ends.  In  another  companion  paper  17  some 
of  the  algorithmic  problems  suggested  by  the  investigations  of  this  paper  are  studied. 

The  many-faces  problem.  In  the  many-faces  problem  our  input  consists  of  m  points  and  n 
manifolds.  We  wish  to  count  or  construct  the  faces  bounding  the  cells  containing  these  points. 
K(m.  n)  is  used  to  denote  the  maximum  number  of  faces  of  the  m  cells  in  a  generic  sense  where  the 
context  defines  the  type  of  the  n  manifolds. 

The  techniques  for  studying  this  problem,  both  here  and  in  r19,  20',  can  be  viewed  as  divide- 
and-conquer  attacks  on  this  problem.  In  the  approach  taken  in  this  paper,  which  we  call  the  primal 
approach,  the  points  are  partitioned  into  groups  according  to  an  underlying  subdivision  of  space 
into  cells,  to  be  specified  shortly.  We  will  call  this  subdivision  the  funneling  subdivision  and  its  cells 
funnels,  to  distinguish  them  from  the  cells  of  the  arrangement  that  we  are  trying  to  compute.  The 
funnels  define  the  subproblems  for  the  divide-and-conquer  attack:  the  subproblem  of  a  funnel  /  is 
the  manv-faces  problem  for  the  points  lying  in  /  and  the  manifolds  intersecting  /.  Notice  that  this 
means  that  each  point  is  passed  to  only  one  subproblem.  while  a  manifold  may  be  passed  to  many 
subproblems  (all  those  corresponding  to  funnels  it  intersects).  In  fact,  a  key  issue  for  the  success 
of  our  method  is  the  design  of  a  strategy  for  choosing  the  funneling  subdivision  so  that  the  average 
funnel  is  not  cut  by  too  many  manifolds. 

The  technique  we  use  for  accomplishing  this  involves  probabilistic  methods:  we  choose  a  random 
sample  R  of  our  given  manifolds  and  •'triangulate*'  the  cells  of  the  arrangement  A{  R).  The  resulting 
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decomposition  is  the  funneling  subdivision.  The  size  r  of  the  random  sample  R  is  a  parameter  that 
we  use  to  optimize  the  effectivity  of  the  method.  Specifically,  this  is  achieved  by  arguing  that  in 
an  average  funnel  the  number  of  points  that  lie  in  it  is  substantially  smaller  than  the  number  of 
manifolds  that  cross  it.  In  such  cases  we  invoke  a  different  kind  or  argument,  specially  tailored 
for  each  specific  type  of  manifold,  to  obtain  a  bound  on  K(m,n).  We  call  these  bounds  Canham 
thresholds.  after  Raymond  J.  Canham  who  was  the  first  to  prove  such  a  bound  for  the  case  of  lines 
[6j. 

The  "triangulation'  involves  breaking  each  cell  of  the  arrangement  A(R)  into  pieces  such  that 
each  of  them  has  constant  descriptive  complexity.  We  need  this  refining  of  the  cells  in  order  to  be 
able  to  apply  to  funnels  arguments  related  to  the  £-net  theory  [37,  11,  12] .  The  triangulation  method 
is  specific  to  the  manifolds  that  we  have  in  each  case.  For  instance,  in  the  case  of  lines,  we  can  draw 
a  vertical  line  segment  through  each  vertex  and  thus  cut  the  cells  right  above  and  right  below  the 
vertex.  This  results  in  a  decomposition  of  each  cell  into  trapezoids  which  have  finite  description  since 
they  have  at  most  four  sides  each. 

Unfortunately  there  is  one  more  difficulty  to  overcome.  It  can  happen  that  the  cell  surrounding 
a  point  p  in  a  subproblem  need  not  be  the  same  as  the  cell  surrounding  p  in  the  full  problem.  The 
two  can  be  different  if  the  cell  surrounding  p  in  the  full  problem  "spills  out"1  of  the  funnel  in  which 
p  happens  to  fall.  But  then  all  these  problematic  cells  are  cells  intersected  by  the  triangulation 
manifolds,  the  manifolds  introduced  to  cut  down  to  constant  size  the  complexity  of  the  funnels 
forming  the  funneling  subdivision.  All  the  cells  intersecting  a  specific  manifold  define  the  zone  of 
that  manifold,  and  good  combinatorial  bounds  for  the  complexity  of  zones  are  already  known  for 
manv  interesting  cases.  Thus  we  are  able  to  deal  with  these  difficult  cells  by  a  special  argument. 

In  the  approach  taken  in  [19,  20],  which  we  call  the  dual  approach,  we  essentially  use  a  funneling 
subdivision  in  the  dual  space.  Our  manifolds  are  partitioned  into  "bundles",  and  a  point  is  distributed 
to  all  the  bundles  that  it  splits,  where  points  not  splitting  a  bundle  lie  on  the  same  side  of  all  manifolds 
of  the  bundle.  In  this  approach  a  new  difficulty  arises:  since  a  point  may  be  distributed  to  many 
subproblems.  we  need  efficient  ways  to  merge  the  cells  for  that  point  returned  by  each  of  the  relevant 
subproblems.  We  refer  the  reader  to  [19,  20]  for  details. 

The  incidence  problem.  The  incidence  problem  is  similar  to  the  many-faces  problem  in  that  its 
input  consists  of  m  points  and  n  manifolds.  An  incidence  is  defined  as  a  point  and  manifold  pair 
so  that  the  point  lies  on  the  manifold.  The  problem  is  to  count  or  bound  the  number  of  incidences 
that  are  possible  for  certain  classes  of  manifolds.  We  use  I{m,  n)  generically  to  denote  the  maximum 
number  of  such  incidences. 

The  approach  to  proving  bounds  on  I(m,  n)  is  essentially  the  same  as  to  the  many-faces  problem. 
so  we  just  note  the  differences  that  arise  simply  because  the  problems  are  different.  Overall,  the 
incidence  problem  is  easier  since  we  do  not  have  to  worry  about  cells  of  the  arrangement  that  cross 
boundaries  in  the  funneling  subdivision.  On  the  other  hand,  the  nature  of  the  incidence  problem 
is  such  that  we  cannot  make  arbitrary  general  position  assumptions  —  in  fact,  configurations  that 
maximize  I(m.n)  will  most  likely  be  highly  degenerate.  For  instance,  we  cannot  assume  that  the 
points  do  not  lie  on  any  of  the  manifolds  and.  in  particular,  on  any  of  the  manifolds  that  are  used  to 
construct  the  funneling  subdivision.  This  creates  the  need  for  upper  bounds  on  the  maximum  number 
of  incidences  between  the  points  and  the  manifolds  of  the  funneling  subdivision.    This  problem  is 
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easier  than  the  original  one  since  the  number  of  sampled  manifolds  is  much  smaller  than  n.  In  some 
cases  such  bounds  are  available  as  "dual"1  counterparts  of  Canham  thresholds,  in  other  cases  more 
elaborate  schemes  have  to  be  used  to  estimate  the  number  of  incidences  contributed  by  sampled 
manifolds. 

The  results  of  this  paper  will  be  summarized  in  the  next  section.  There  we  also  give  pointers  to 
the  literature  and  to  the  places  in  this  paper  where  the  results  can  be  found.  Roughly,  the  structure 
of  this  paper  is  as  follows.  In  Section  3  we  describe  the  details  of  our  method  for  the  case  of  the 
many-faces  problem  in  line  arrangements.  This  problem  has  a  relatively  easy  proof  and  is  used  to 
describe  the  general  proof  method  without  too  much  interference  by  technical  difficulties  that  arise 
for  particular  types  of  manifolds.  Section  4  gives  a  general  technique  for  proving  Canham  thresholds. 
The  major  tools  in  that  section  are  an  extremal  theorem  for  bipartite  graphs  and  various  elementarv 
geometry  and  topology  lemmas.  All  two-dimensional  many-faces  and  incidence  results,  including 
applications  to  two-dimensional  distance  problems,  are  described  in  Section  5.  In  Section  6.  we 
derive  results  on  the  incidence  problem  for  points  and  spheres  in  three  dimensions  and  on  related 
distance  problems  in  space.  Finally,  we  discuss  the  contributions  of  this  paper  and  mention  a  few 
problems  that  remain  open  in  Section  7.  Throughout  the  paper  we  ignore  constants  and  concentrate 
on  asymptotic  results  when  we  state  and  prove  theorems.  We  will,  however,  remark  on  the  constants 
that  follow  from  our  methods  whenever  this  seems  advisable. 


2      Summary  of  Results 

The  results  of  this  paper  are  in  three  categories:  bounds  for  many-faces  problems,  bounds  for  in- 
cidence problems,  and  bounds  for  combinatorial  distance  problems.  We  summarize  the  results  oi 
each  category  using  tables  which  give  the  bounds  and  provide  pointers  to  places  in  this  paper  where 
the  final  results  are  obtained.  Some  of  these  bounds  are  given  in  terms  of  a  generic  function  3(n) 
or  j(m,n)  which  changes  from  case  to  case.  However,  in  each  case  it  is  a  function  that  depends 
on  the  inverse  of  Ackermann's  function  and  grows  extremely  slowly.  In  addition,  we  briefly  discuss 
each  bound  and  relate  it  with  earlier  results  in  the  literature.  Beyond  the  three  categories,  we  also 
list  here  a  few  results  of  this  paper  which  are  needed  to  obtain  the  combinatorial  bounds  and  are  of 
independent  interest. 

Many-Faces  Bounds.  The  general  theme  in  this  category  is  to  derive  upper  and  lower  bounds  on 
A'(m.n),  the  maximum  number  of  edges  bounding  m  distinct  cells  in  an  arrangement  of  n  curves 
of  a  certain  kind.  We  have  results  for  lines,  pseudolines.  unit-circles,  circles,  and  pseudocodes  i  see 
Table  2.1)  \ 

The  upper  bounds  for  lines  and  pseudolines  improve  previous  bounds  given  in  6.  23  :  the  matching 
lower  bound  can  be  found  in   231  (see  also  [15]). 

The  lower  bound  for  lines  extends  to  unit-circles  (and  therefore  also  to  circles  and  pseudocircles)  since 
they  can  be  blown  up  so  as  to  approximate  any  pattern  of  Lines  arbitrarily  close  within  a  bounded 
region.  This  implies  that  our  upper  bound  for  unit-circles  is  off  by  at  most  a  factor  of  j(  n)  from  the 
best  known  lower  bound  —  this  factor  is  probably  an  artifact  of  the  proof  technique  used.  The  most 


For  technical  reasons  we  assume  that  a  pseudoline  i  pseudocode  I  intersects  any  vertical  line  m  one  point  i  at  most 
two  points). 
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K(m,  n) 

section(s)  ] 

lines 

Q{m2'3n2/3  +  n) 

3  and  5.6 

pseudolines 

Q{m2'3n2'3  +  n) 

5.6 

unit-circles 

0(m2'3n2'33(n)  +  a) 

5.6 

circles 

0(m3/5n^5J(n)  *  n) 

5.6 

pseudocodes 

0(m3/5n4/5,j(n)  +  ra) 

5.6 

Table  2.1:  Summary  of  bounds  for  many-faces  problems. 

difficult  part  of  the  upper  bound  proof  for  unit-circles  is  the  Canham  threshold  (Section  4.5)  which 
states  that  any  m  cells  in  an  arrangement  of  n  unit-circles  have  at  most  0(mn'l':  +  n)  edges.  The 
techniques  used  to  establish  this  result  are  of  independent  interest. 

The  upper  bounds  for  circles  and  pseudocodes  are  also  new  and  follow  readily  from  the  general 
techniques  of  this  paper.  No  matching  lower  bounds  are  known.  In  fact,  the  currently  best  lower 
bound  is  the  same  as  for  unit-circles  except  for  a  small  range  of  values  of  m  where  a  slightly  better 
bound  can  be  shown  (see  remark  (2)  after  Corollary  5.6). 

Incidence  Bounds.  As  in  the  introduction,  we  use  I(m,n)  generically  for  the  maximum  number 
of  incidences  between  m  points  and  n  curves  or  surfaces  in  two  or  three  dimensions.  Recall  that 
we  have  an  incidence  if  a  point  lies  on  a  curve  or  surface.  In  two  dimensions  we  have  results  for 
lines,  pseudolines,  unit-circles,  circles,  and  pseudocodes.  In  three  dimensions  we  consider  spheres 
and  distinguish  two  cases:  (i)  no  three  spheres  intersect  in  a  common  circle  (referred  to  as  spheres 
in  "general  position"),  and  (ii)  the  points  are  restricted  to  vertices  of  the  sphere  arrangement.  The 
bounds  are  listed  in  Table  2.2. 


1 

I(m,  n)                      j  section(s)  j 

lines 

0(m2''3n:/3  +  m  +  n)          \  3  and  5.3  ' 

pseudolines 

0(m2/3;i2/3  4-rn  +  n)                 5.3 

unit-circles 

0(m2'3n2'3  -m-n)                 5.3 

circles 

0(m3/5n4/5  -f-m  +  n) 

5.3 

pseudocircles 

0(m3/5n4/5  4-m-i-n) 

5.3 

spheres  in  gen.  pos. 

0(m3/4n3/4/3(m,n)  +  m  +  n)            6.4 

spheres  and  vertices 

0(m4/7nd/7(3(m.  n)  +  n2)              6.4 

Table  2.2:  Summary  of  bounds  for  incidence  problems. 


The  bound  for  lines  is  not  new  and  goes  back  to  [56];  however,  our  proof  is  much  simpler  than 
theirs.  Furthermore,  the  constant  of  proportionality  in  our  analysis  is  only  3y6  (see  Section  3) 
whereas  [56l  establishes  the  upper  bound  with  a  constant  equal  to  10°°.  Our  proof  also  extends  to 
pseudolines  for  which  case  our  upper  bound  appears  to  be  new. 

The  upper  bound  for  unit-circles  is  also  not  new  and  was  derived  earlier  in  [54].   Again,  our  proof 
is  simpler  and  the  constant  that  follows  from  our  proof  is  much  smaller  than  theirs.  It  seems  Likely 
that  the  upper  bound  is  not  tight;  the  best  lower  bound  known  for  the  case  m  =  n  is  n1^'     °s10sn, 
for  some  constant  c  (see  Erdos   24i). 
For  circles  (and  pseudocircles)  our  upper  bound  improves  the  Q(m3/An3/*  +■  m  -  n)  bound  of  5.  10  . 
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Again,  no  matching  lower  bound  is  known,  although  fi( m2/3n2/3  -  m  -  n)  can  be  derived  from  the 
lower  bound  for  lines  (see  remark  (3)  after  Theorem  5.4)  and  a  slight  improvement  of  this  lower 
bound  for  a  small  range  of  values  of  m  follows  from  a  result  by  Erdos  '24'  (see  remark  (2)  after 
Corollary  5.6). 

Our  bounds  for  spheres  in  general  position  improves  the  0(m4/5n4/s  +  m1  2n  ~  m)  bound  of  Chung 
'lOi.  A  lower  bound  of  fl(m4/3  log  log  m)  can  be  obtained  in  the  case  m  =  n  from  the  currently 
best  lower  bound  for  the  unit-distance  problem  in  three  dimensions  discussed  below.  The  most 
difficult  step  in  the  proof  of  the  upper  bound  is  the  triangulation  of  an  arrangement  of  spheres  into  a 
number  of  funnels  which  is  only  slightly  supercubic  in  the  number  of  spheres  (see  Section  6.3).  This 
decomposition  (Theorem  6.6)  is  of  independent  interest. 

For  the  case  of  spheres  where  the  points  must  be  vertices  of  the  arrangement  our  bound  is  the  first 
known  non-trivial  upper  bound.  No  matching  lower  bound  is  known  if  m  >  en0  4.  for  some  constant 
c.  To  prove  the  upper  bound  for  spheres  and  vertices  we  derive  an  extension  of  an  extremum  result 
for  bipartite  graphs  with  certain  complete  subgraphs  prohibited  (see  Lemma  6.2):  this  result  is  of 
independent  interest. 

Distance  Problems.  Using  the  bounds  for  incidence  problems  summarized  in  Table  2.2  we  derive 
new  bounds  for  a  variety  of  combinatorial  distance  problems  in  two  and  three  dimensions.  We  first 
list  our  results  on  repeated  distances  (see  Table  2.3);  m  denotes  the  number  of  points. 


bound 

section 

unit-distance  in  the  plane 

Ofm4'3) 

5.4 

unit-distance  on  a  sphere 

0(m4'3) 

5.4 

unit-distance  in  space 

1  0( 

m3'23(m 

)) 

6.5 

bichromatic  maximum  distance 

Q{m) 

6.5 

bichromatic  minimum  distance 

I  0( 

m3  23{m 

)) 

6.5 

Table  2.3:  Summary  of  results  on  repeated  distances. 

The  upper  bound  on  unit-distances  in  the  plane  is  not  new  and  dates  back  to  54  —  however, 
their  constant  of  proportionality  is  much  larger  than  the  one  that  follows  from  our  proof. 
Our  upper  bound  proof  for  unit-distances  in  the  plane  carries  over  to  points  on  a  sphere  in  three 
dimensions  (see  remark  (4)  after  Theorem  5.4)  in  which  case  a  matching  lower  bound  can  be  con- 
structed from  an  arrangement  of  m/2  lines  and  mi  2  points  with  fi(m4'3)  incidences  (see  [29!).  It  is 
interesting  that  the  lower  bound  is  such  that  the  unit-distance  is  exactly  one  fourth  of  the  length  of 
a  great-circle.  For  other  distances  the  currently  best  lower  bound  is  fifmlog*  m)  29  . 
The  upper  bound  for  unit -distances  in  three  dimensions  improves  the  best  previous  bound  which  was 
0{miis)  1 10].  We  consider  this  the  most  important  new  result  on  counting  distances  derived  in  this 
paper.  The  currently  best  lower  bound  for  this  problem  is  fi(m4/3  log  log  m)  25;. 
In  the  bichromatic  case  we  have  a  total  of  m  red  and  blue  points  in  three-dimensional  space  and  we 
consider  only  distances  between  points  of  different  color.  The  linear  upper  bound  on  the  number  of 
times  the  maximum  bichromatic  distance  can  occur  appears  to  be  new  although  the  proof  is  only 
a  minor  extension  of  the  proofs  of  the  same  asymptotic  bound  for  the  number  of  diameters  in  the 
monochromatic  case  (see  33,  38.  55i).  For  the  minimum  bichromatic  distance  our  upper  bound  is 
new;  however,  no  superlinear  lower  bound  is  known. 
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A  problem  related  to  repeated  distances  is  that  of  the  maximum  number  of  furthest  neighbor  pairs 
in  a  set  of  m  points  in  three  dimensions.  If  no  three  points  are  collinear  we  show  that  0(m3'  J(m)) 
is  an  upper  bound  which  improves  the  0(m8/5)  bound  of  [lOj  (see  Section  6.5).  No  superiinear  lower 
bound  is  known. 

Finally,  we  have  some  new  results  on  different  distances.  Let  P  =  {pi,P2, .  .  ■•Pm}  be  a  set  of 
points  either  in  two  or  in  three  dimensions.  For  1  <  i  <  m  define  g;  as  the  number  of  different 
distances  from  p,.  g(P)  =  £™  t  gu  and  g(m)  =  minipi=m{g(P)}.  Our  results  are  listed  in  Table  2.4. 


bound 

section 

g(m)  in  the  plane                          |]  f2(m'/4) 

5.4 

g(m)  in  space  (no  collinearity) 

n(SS) 

6.5 

Table  2.4:  Summary  of  results  on  different  distances. 

The  lower  bound  in  two  dimensions  improves  the  f2(m5,/3)  bound  which  follows  from  the  bounds 
in  '5,  LOj  on  the  number  of  incidences  between  points  and  circles  in  the  plane. 

In  three  dimensions  our  lower  bound  is  established  only  for  the  case  where  no  three  points  are 
collinear:  it  improves  the  fi(m3/2)  bound  which  follows  from  the  incidence  bound  on  points  and 
spheres  in  [101. 

3      Arrangements  of  Lines  —  an  Example 

In  this  section  we  show  that  the  maximum  number  of  edges,  A'(m.  n),  bounding  m  cells  in  an 
arrangement  of  a  lines  is  0(m2/3n2''3  -  n).  This  improves  the  known  upper  bounds  0(m2  -n)  6  and 
0(mn1'2  -f  n)  and  0(m1/2a  +  m)  [23].  The  proof  illustrates  the  use  of  the  general  proof  components 
listed  in  the  introduction.  Indeed,  this  section  serves  as  an  introduction  to  our  techniques  before 
they  are  discussed  in  greater  generality  and  different  contexts  in  Sections  4  through  6.  We  also 
extend  the  proof  to  the  related  incidence  problem  and  show  that  I{m,  n)  =  0(m2/3n2/  -  m  -  n). 
Another  purpose  of  this  section  is  to  demonstrate  that  our  techniques  yield  a  proof  for  this  result 
which  is  much  simpler  than  the  that  in  [56].  In  addition,  we  get  the  extra  benefit  that  the  constant  of 
proportionality  that  we  get  is  reasonably  small  whereas  the  constant  in  [56]  is  astronomically  large. 

Let  L  =  {li,lz,-..,ln}  be  the  given  collection  of  lines,  and  suppose  that  the  m  desired  cells  are 
designated  by  a  collection  of  points  P  =  {pi,P2,  •  • -,Pm},  where  each  pi  lies  in  a  unique  cell.  The 
proof  proceeds  through  the  following  six  steps. 

(1)  Establishing  a  Canham  Threshold.  Canham's  upper  bound  of  0(m2  -  n)  on  K(m.n)  can 
readily  be  transformed  (by  breaking  the  given  cells  into  groups  of  n1/2  cells  each)  to  K(m,n)  = 
0(mn1/2  4-  n)  which  is  the  form  that  we  will  use.  An  alternative  proof  of  the  latter  bound  can  be 
derived  from  a  well-known  extremal  theorem  on  graphs  (see  [44,  26,  51]).  We  restate  this  theorem 
as  Lemma  4.1  in  a  form  that  is  convenient  and  allows  us  to  immediately  derive  many  similar  bounds 
for  curves  other  than  lines  and  for  spheres  in  three  dimensions. 
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(2)  Sampling  and  Triangulating.  We  choose  a  subset  R  Z  L  of  size  r.  where  r  is  an  integer  to 
be  specified  later.  Let  A(R)  be  the  arrangement  defined  by  R  and  suppose  that  each  cell  is  further 
decomposed  into  trapezoids  by  drawing  a  maximal  vertical  (relatively  open)  line  segment  through 
each  vertex  of  A{  R)  so  that  the  only  intersection  of  the  line  segment  with  the  lines  in  R  is  this  vertex 
(see  Figure  3.1).   This  results  in  a  collection  of  k  =  0(r2)  (open)  trapezoids3,  the  aforementioned 


Figure  3.1:  Each  cell  is  decomposed  into  trapezoids. 

funnels.  For  each  funnel  or  trapezoid  A,,  1  <  i  <  k,  define  P,  =  P  "  A,  and  let  L,  be  the  subset  of 
lines  that  have  a  non-empty  intersection  with  At.  Set  m,  =  P,  and  a,  =  Lx  and  note  that  3Z T=  i  m> 
is  equal  to  m,  the  number  or  points9. 


(3)  Probabilistic  Counting.  Using  the  method  of  probabilistic  counting  we  can  show  that  there 
exists  a  subset  R  C  L  of  size  r  so  that  3Ii=i  m^nl  "  -  0(  rn(n/r)l<2).  The  idea  of  the  proof  of  this 
claim  is  that  if  R  is  chosen  at  random,  then  the  expected  value  of  £),_i  mtn:  "  is  0(m(n  r)1  2). 
It  follows  that  there  exists  a  sample  with  this  property  (in  fact,  most  samples  have  this  property). 
To  compute  the  expected  value  of  the  above  random  variable,  we  observe  that  the  sum  is  the  same 
as  ^;=i  q,  ",  where  q}  is  the  number  of  lines  intersecting  the  funnel  that  contains  p.  £  P.  Clearly. 
<7;  =  n,  if  pj  g  A,.  Since  the  expectation  is  additive,  even  if  events  are  dependent,  we  can  concentrate 
on  showing  that  the  expected  value  of  q      ,  denoted  by  E[q:    ],  is  0((n/ r)l/:).   E[q  '  ]  is  bounded 

from  above  by  Er,qj}1^2  since  E[qj\  -  E[q  j2  is  the  variance  of  q  ' '  and  therefore  always  non-negative: 
so  it  suffices  to  show  that  E[qj]  =  0(n/r). 

The  crucial  step  in  proving  E[qj\  —  0(n/r)  is  the  decomposition  of  the  cells  into  regions  of 
constant  size,  such  as  trapezoids  which  are  determined  by  four  or  fewer  lines  (see  Figure  3.2  for  the 
various  ways  how  four  or  fewer  Lines  can  define  a  trapezoid).  Such  a  trapezoid  A  has  the  property 
that  it  is  in  the  triangulation  of  -4(P)  if  and  only  if  A  is  in  the  triangulation  of  .41  R')  for  some 
R'  C  R,  R"  <  4,  and  A  does  not  intersect  any  Line  in  R  -  R'.  Based  on  this  observation  the 
following  intuitively  plausible  statement  can  be  made  concrete:  the  fact  that  A  meets  few  Lines  of 
the  random  sample  means  that  it  meets  few  Lines  of  L.  The  rest  of  the  argument  is  technical  and 
the  details  can  be  found  in  Section  5.2. 


*A  ti^ht  upper  bound  on  k  is  3(J)  —  r  *•  1  since  every  one  of  the  at  most  (J)  vertices  gives  rise  to  two  cell  splits  and 
the  initial  number  of  cells  is  at  most  (')  -  r  —  1. 

'Equality  holds  because  we  can  assume  that  the  points  in  P  are  in  general  position  so  that  none  of  them  lies  on  the 
boundary  of  a  funnel. 
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Figure  3.2:  Every  trapezoid  is  defined  by  four  or  fewer  lines. 

(4)  Divide- and- Conquer.  Using  the  triangulation  of  the  sample  arrangement  A(R)  we  solve  a 
subproblem  in  each  funnel.  Specifically,  within  A,  we  consider  the  n,  lines  of  Lx  and  count  the  edges 
of  -4(1,)  bounding  the  m,  cells  marked  by  the  points  in  P,.  The  division  of  the  original  problem  into 
k  =  0(r2)  subproblems.  however,  is  not  as  straightforward  as  one  might  hope.  Take,  for  example,  a 
point  p  £  P,  and  let  c,(p)  be  the  cell  in  A(Lt),  not  necessarily  restricted  to  A,,  that  contains  p.  If 
cAp)  is  contained  in  A,  then  ct(p)  =  c(p),  the  cell  in  the  full  arrangement  A(L)  that  contains  p.  Ln 
this  case  the  edges  ofc(p)  are  all  accounted  for  in  the  subproblem  for  A,.  But  what  if  c,(p)  intersects 
the  boundary  of  A,?  In  this  case  we  have  ct(p)  n  A,  =  c(p)  ~i  A,.  The  edges  of  c(p)  that  are  or 
contain  edges  of  c,(p)  n  A,  are  accounted  for,  but  it  is  possible  that  c{p)  has  also  other  edges  that 
lie  fully  outside  A,.  For  example,  there  can  be  a  funnel  A;,  j  ^  i,  and  a  cell  cj  in  A(L3)  so  that 
c:  n  A_,  =  c(p)  r  Ar  The  difficulty  with  this  case  comes  from  the  fact  that  A;  does  not  contain  p  and 
Cj  is  therefore  not  marked  as  a  cell  whose  edges  must  be  counted.  An  important  observation  which 
facilitates  a  way  to  overcome  this  difficulty  is  that  c2  intersects  the  boundary  of  A;  since  c{p)  Z  Cj 
and  c(p)  is  not  contained  in  Ar  Thus,  c;  belongs  to  what  we  call  the  "inner  zone*'  of  Ay  (more 
about  this  concept  in  the  next  step  of  the  proof).  To  be  on  the  safe  side  we  count  the  edges  (within 
Aj)  of  all  inner  zone  cells  for  all  1  <  j  <  k. 

(5)  Complexity  of  Zones.  The  inner  zone  of  A;  in  A(Lj)  is  denned  as  the  collection  of  c  ■"  A;  for 
all  cells  c  in  A(L})  that  intersect  the  boundary  of  A;  (see  Figure  3.3).  The  combinatorial  complexity 


Figure  3.3:  The  inner  zone  of  a  funnel, 
of  the  inner  zone  of  A;  is  defined  as  the  total  number  of  edges  bounding  its  cells10. 


'Not  that  it  matters  much  but  an  edge  is  counted  twice  if  it  bounds  two  cells  of  the  inner  zone. 
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The  above  discussion  implies  that  the  total  number  of  edges  bounding  the  m  cells  in  A(L)  is  thus 

,  1/2 

bounded  from  above  by  3Z T=  i  Ofm.r^  *  +  n,-)  (using  the  Canham  threshold  for  the  cells  contained 
in  their  funnels)  plus  the  combinatorial  complexity  of  all  inner  zones,  one  per  funnel.  In  the  case  of 
line  arrangements,  the  complexity  of  a  zone  is  linear  in  the  number  of  lines  (see  '8,  21  or  15  )11. 
Therefore,  the  complexity  of  the  inner  zone  of  A;  is  0(  n; )  which  implies  that  all  inner  zones  together 
have  combined  combinatorial  complexity  3Z?=i  0(n:). 

This  and  the  preceding  sum  of  Canham  bounds  make  it  necessary  to  bound  the  sum  of  the  n; 
taken  over  all  funnels  A;.  Of  course.  £j=i  »j  =  Yl?si  Ui  where  /,  is  the  number  of  funnels  intersected 
by  line  t,1-.  Since  i,  intersects  the  lines  of  R  in  at  most  p  points  it  intersects  at  most  p  -t-  1  cells  of 
.4(/?|.  By  the  zone  result,  invoked  again,  the  total  number  of  edges  bounding  those  cells  is  0(r). 
Since  a  cell  with  t  edges  is  decomposed  into  at  most  t  funnels,  this  implies  that  (t  intersects  O(r) 
funnels  of  A{R).  Therefore  3I;=i  nj  =  O(rn). 

(6)  Wrapping  It  Up.  By  what  we  have  said  so  far.  assuming  R  is  a  sample  which  satisfies  the 
property  discussed  in  (3), 

K(m,n)  =  YO(mtnl/2  -  nt)  =  0(m(-)        -  rn). 
,  =  i  p 

We  choose  r  so  as  to  balance  the  two  terms  of  the  above  bound,  namely  r  =  0(m2  3rc~1  3).  Note 
that  this  is  meaningful  only  if  m  is  larger  than  n1/:;  fortunate  for  us.  Canham's  bound  implies 
A'(m,  n)  =  O(rc)  in  the  other  case.  Putting  the  two  cases  together  we  thus  obtain  the  asserted  bound 

A'(m,  n)  =  0(m:'V'3  -  n). 

Remark.  The  upper  bound  on  A'(m,  n)  whose  proof  is  sketched  above  is  tight.  The  lower  bound 
example  is  based  on  arranging  the  points  in  a  square  grid  and  choosing  the  lines  close  to  highly 
populated  (not  necessarily  horizontal  or  vertical)  rows  of  points  (see  [15]  for  details). 

Computing  the  Constant  of  the  Upper  Bound.  The  above  proof  is  simple  enough  so  that  we 
can  compute  a  reasonable  constant  of  proportionality  without  adding  too  much  complication  to  the 
proof.  We  do  this  by  computing  in  turn  the  constants  in  the  Canham  threshold,  the  probabilistic 
counting  result,  and  the  complexity  of  zones.  For  this  we  use  results  that  are  derived  later  in  this 
paper,  specifically  in  Sections  4.1  and  5.2.  The  reader  may  choose  to  skip  the  rest  of  this  section  on 
first  reading  and  return  to  it  after  these  results  are  presented. 

Any  two  cells  in  a  line  arrangement  can  have  at  most  four  lines  contributing  edges  to  both  cells. 
Using  remark  (1)  after  the  bipartite  graph  lemma  (Lemma  4.1)  we  can  set  3  =  2  and  t  =  5  and  thus 

1/2 

get  2m, nl  4-  n,  as  an  upper  bound  on  the  number  of  edges  bounding  m,  cells  in  an  arrangement 
of  a,  lines.  When  we  take  the  sum  of  the  first  term  over  all  funnels  A,  we  can  use  the  probabilistic 
counting  result  (to  be  established  in  Section  5.2)  which  says  that  there  exists  a  sample  R  Z  L  of  size 
p  such  that  £A|  m,^  "  <  2m(  jrrj)1'2.  To  get  a  good  constant  in  the  bound  for  the  sum  of  the  n,  is 


[n  these  sources  a  zone  is  defined  as  the  collection  of  cells  in  a  line  arrangement  that  intersect  another  line.    It  is 
plain  that  the  maximum  number  of  edges  bounding  the  inner  lone  of  a  trapezoid  A,   (not  counting  the  edges  that  lie 
on  sides  of  A,  I  is  at  most  four  times  as  large  as  the  maximum  number  of  edges  bounding  a  i  standard)  zone  in  .41  L,  ). 
l2Lf  I,  i  R  then  /,   =  0  since  we  treat  funnels  as  open  sets. 
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more  difficult;  it  is  treated  implicitly  when  we  get  a  bound  on  the  number  of  edges  contributed  bv 
the  zones  of  the  funnels. 

First  we  need  to  observe  that  the  inner  zone  of  a  half-plane  (a  degenerate  trapezoid)  in  an 
arrangement  of  n,  lines  has  at  most  4n,  -  2  edges  that  do  not  also  lie  on  the  bounding  line  of  the 
half-plane  (see  [15]).  Here  and  later  we  count  an  edge  twice  (each  side  once)  if  the  cells  on  both  sides 
belong  to  the  inner  zone.  Second,  notice  that  a  cell  of  A(L)  that  lies  in  more  than  one  funnel  must 
intersect  at  least  one  vertical  side  of  each  funnel  it  meets.  This  is  because  the  top  and  bottom  sides 
of  a  funnel  are  part  of  two  original  lines.  Thus,  we  count  at  most  3n,  -  4  inner  zone  edges  for  A,.  If 
such  a  counted  edge  meets  a  vertical  side  of  A,  (and  there  are  at  most  4rc,  -  4  edges  which  do  not13) 
then  this  edge  is  double  counted  because  it  reaches  into  the  neighboring  funnel  where  it  is  part  of 
the  inner  zone  again.  This  shows  that  the  contribution  of  all  inner  zones  is  at  most 

A,  A,  A, 

To  get  a  bound  on  this  sum  we  use  £A.  n,  =  £?._i  lt,  whe're  /,  is  the  number  of  funnels  intersected 
by  line  I,.  We  now  show  /,  <  or  -  5.  Line  £,  intersects  at  most  r  -  1  cells  of  the  sample  arrangement. 
and  by  the  zone  theorem  (see  [15])  theses  cells  have  a  total  of  at  most  6r  -6  vertices  (where  we  count 
a  vertex  once  for  each  such  cell  to  which  it  belongs).  The  two  endpoints  of  each  edge  of  A(R)  that 
intersects  I,  are  counted  at  least  twice  which  shows  that  there  are  at  most  4r  -  6  distinct  vertices. 
Each  of  these  vertices  is  an  endpoint  of  a  vertical  funnel  side  that  potentially  intersects  lt.  Adding 
to  this  the  at  most  r  intersections  with  lines  in  R  gives  at  most  5r  -  6  intersections  with  sides  of 
funnels  and  therefore  /,  <  or  -  5. 

We  now  put  these  partial  results  together  and  choose  r  to  minimize  the  resulting  constant.  We 
have 

v^            xl"1                                       I     n     n1/2      v^                   /     n     \1/2 
A'(m.  n)  <  >    (2rriini      —  rii  +  6n,  -  4)  <  4m( J        +  ^  7n,  <  4ra( J        +■  35rn  -  35n. 

At  A, 

If  we  choose  r  =  cm2/3n~1/3  +  c\  with  c  =  (^)2^3  and  4  <  c'  <  5  so  that  r  is  an  integer,  we  get 
Kim,  n)  <  4-m2/3n2/3  +■  35cm2/V/3  +  35(c'  -  l)n  <  3\/U0m2/3n2/3  +  140n, 

£1/2 

where  3^140,  the  constant  factor  of  the  leading  term,  is  approximately  15.58.  The  above  choice  of  r 
is  appropriate  as  long  as  m  >  yn1^.  In  the  other  case  the  Canham  threshold  implies  K{m.  n)  <  36a 
which  shows  that  the  above  bound  is  correct  without  any  restriction  on  the  number  of  cells. 

Extension  to  Incidence  Problem  for  Lines.  It  is  rather  straightforward  to  prove  2l{m,n)  < 
K(m,2n)  if  m  <  («):  start  with  a  configuration  that  maximizes  I(m,n),  replace  each  point  by  a 
sufficiently  small  disk,  each  with  the  same  radius,  and  replace  each  line  by  two  parallel  lines  that  are 
tangent  to  all  disks  corresponding  to  points  on  the  original  line  (see  [23j).  This  inequality  implies 
I(m,n)  <  3v/T0^2/3n2/3  -  140n.  We  will  improve  the  constants  of  this  bound  below. 


l3We  arrive  at  the  4n,  -  4  as  follows.  First  assume  that  all  n,  lines  intersect  both  vertical  sides  of  the  trapezoid. 
In  this  case,  there  are  2n,  edges  meeting  each  side;  thus  at  most  4n,  —  4  edges  remain.  Without  this  assumption  the 
number  of  edges  that  do  not  meet  a  vertical  side  can  only  be  smaller  since  we  could  stretch  the  trapezoid  vertically  so 
as  to  lengthen  its  vertical  sides. 


Combinatorial  Complexity  Bounds  for  Arrangements  12 

We  follow  the  analysis  given  above  for  the  many-faces  problem  but  modify  it  in  two  significant 
ways.  The  first  change  addresses  the  fact  that  we  can  no  longer  assume  that  the  lines  are  in  general 
position.  More  specifically,  we  have  to  permit  parallel  and  concurrent  lines  but  we  may  still  assume 
that  no  Line  is  vertical  and  that  no  point  is  vertically  aligned  with  another  vertex  of  the  arrangement, 
since  we  can  rotate  the  configuration  of  Lines  and  points  without  changing  their  incidences.  The 
second  assumption  turns  out  to  be  very  convenient  since  it  implies  that  no  point  lies  on  a  vertical 
side  of  a  trapezoid  defined  by  four  or  fewer  lines.  The  second  change  has  to  do  with  the  fact  that 
in  order  to  maximize  the  number  of  incidences  all  points  Lie  on  the  Lines  rather  than  in  the  cells  of 
the  arrangement.  Since  we  do  not  count  edges  of  the  cells  we  can  ignore  the  contribution  of  zones 
altogether,  but  now  we  have  to  take  into  account  the  number  of  incidences  that  happen  on  the  Lines 
of  the  sample  arrangement.  We  address  the  first  issue  first  and  then  derive  the  upper  bound  with 
the  improved  constants  of  proportionality. 

A  crucial  assumption  in  the  probabilistic  counting  argument  is  that  a  trapezoid  is  a  funnel  of  a 
sample  arrangement  if  and  only  if  it  is  a  trapezoid  defined  by  four  or  fewer  Lines  of  the  sample  and  it 
does  not  intersect  the  other  lines  of  the  sample.  This  is  because  two  such  trapezoids  are  necessarily 
disjoint.  Notice,  however,  that  this  disjointness  property  is  lost  if  the  Lines  are  not  in  general  position. 
To  avoid  the  technical  difficulties  in  dealing  with  Lines  in  special  position  we  perturb  the  Lines  ever  so 
slightly  so  that  they  are  in  general  position.  The  trapezoid  defined  by  four  or  fewer  original  lines  is 
just  the  unperturbed  version  of  the  trapezoid  of  the  corresponding  perturbed  Lines,  and  we  say  that 
it  does  not  "intersect"  any  sample  lines  if  the  corresponding  perturbed  trapezoid  does  not  intersect 
any  of  the  perturbed  sample  lines.  Observe  that  the  perturbation  is  only  used  to  define  and  classify 
trapezoids  —  the  arrangement  of  the  problem  is  left  unperturbed  so  no  incidences  are  changed.  Thus 
the  claim  about  the  existence  of  a  sample  of  r  lines  with  certain  properties  discussed  in  (3)  above 
holds  for  degenerate  cases  as  well. 

We  can  now  go  ahead  with  the  analysis.  For  the  Canham  threshold  we  get  I(m,  n)  <  mn1  :  -  n 
since  no  two  points  Lie  on  two  common  Lines;  setting  a  =  t  =  2  and  using  remark  (L)  after  the 
bipartite  graph  lemma  (Lemma  4.1)  implies  the  bound.  Now  take  a  "good"  sample  R  Z  L  of  size  r. 
triangulate  A(R),  and  consider  the  funnels  A,  separately.  This  gives 

I{m,n)<  y(m,/!j'*-n,)r  m  +  rn, 

where  m,  is  the  number  of  points  in  the  ith  funnel  and  n,  is  the  number  of  Lines  that  cut  it.  The  term 
m  -  rn  on  the  right  side  accounts  for  possible  incidences  involving  points  that  lie  on  sample  lines14. 
Using  the  bound  from  the  probabilistic  counting  argument  and  ^^  n,  <  (5r  -  5)n  as  derived  earlier 
we  thus  get 

/(m,n)<2m( )  '     -  6rn  -  5n  ~  m. 

Lf  we  now  choose  r  =  %~2,z■m2|:in~x,3  +■  c'  (assuming  m  >  6n1' ;),  with  4  <  c'  <  5  so  that  r  is  integer. 

we  get 

ri  \  ^  n  *G     2/3  2/3  ,  >/7   2/3  2/3  ,  fa    '   -\ 

/(m,  n)  <  2  v  6m  '   n       +-  v  6m  '    n  '     r  (6c   -  o)n  -  m. 


Each  line  that  does  not  belong  to  the  sample  intersects  the  r  sample  lines  in  at  most  r  points  and  therefore 
contributes  at  most  r  incidences.  Each  sample  line  has  at  most  r  incidences  with  points  that  lie  at  vertices  ol"  the 
sample  arrangement.  Finally,  each  point  that  lies  on  a  sample  line  but  is  not  a  vertex  of  the  sample  arrangement  is 
incident  to  at  most  one  sample  line. 
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Thus, 

I[m,n)  <  3v6m2/3n:/3  +  25n  -  m. 

The  multiplicative  constant  of  the  leading  term,  3^6,  is  approximately  5.45.  When  m  <  6n1/2  we 
get  I(m.n)  <  ~n  from  the  Canham  threshold,  and  we  get  /(m,  n)  <  (£)  -j-  m  <  2m  if  m  exceeds 
(")  since  at  most  (£)  points  can  be  incident  with  more  than  one  line  each.  Thus,  if  we  replace  the 
additive  term  m  by  2m  in  the  above  upper  bound  for  I[m,  n),  then  it  holds  without  any  restriction 
on  the  value  of  m  relative  to  n.  In  summary  we  have  shown 

Theorem  3.1   For  arrangements  of  lines  in  the  plane  we  have 

A'(m,  n)  <  3v/H0m2/:!r!:/3  +  140n,    and 
/(m,  n)  <  3v/6m2/3n2/3  +  25n  +  2m. 


4     Canham-like  Upper  Bounds 

Notice  that  the  nature  of  our  partitioning  process  (exemplified  in  Section  3)  is  such  that  each  point 
is  passed  to  exactly  one  subproblem.  while  a  manifold  may  be  distributed  to  several  subproblems, 
according  to  the  number  of  funnels  it  cuts.  Thus  we  can  expect  that  in  a  subproblem  the  ratio  of 
manifolds  to  points  increases,  on  the  average.  We  use  this  effect  to  our  advantage  when  we  choose 
the  size  of  the  sample  which  determines  the  number  of  subproblems  created.  The  reason  is  that 
when  the  number  of  points  is  small  as  compared  to  the  number  of  manifolds,  we  can  use  a  different 
type  of  argument  to  bound  the  complexity  of  the  cells  containing  the  points. 

These  other  arguments  employ  techniques  from  extremal  graph  theory  on  the  bipartite  graph 
defined  by  the  incidences  between  the  points  or  cells  and  the  given  manifolds.  They  usually  yield 
bounds  of  the  form  I(m,  n)  or  A"(m,  n)  =  Ojmn1"1'1'  +  n),  for  some  small  integer  k  (Canham's 
original  bound  for  lines  can  also  be  cast  in  this  form).  Thus  if  the  number  of  points  is  really  small, 
say  m  =  0(nl/k),  compared  to  the  number  of  manifolds,  then  these  arguments  show  that  /(m,  a)  or 
K{m,  n)  =  0(a)  and  provide  the  basis  for  our  reduction. 

In  this  section  we  extend  Canham's  bound  for  lines  to  arrangements  of  more  general  curves  such 
as  circles  and  pseudocodes.  These  results  will  serve  as  bases  which  can  be  lifted  to  bounds  for 
general  values  of  m  using  the  techniques  exemplified  in  Section  3  and  to  be  described  fully  in  Section 
5.  Section  4.1  will  present  the  extremal  graph  lemma  which  will  be  sufficient  to  derive  the  Canham 
thresholds  for  incidences  (see  Section  4.2).  Additional  tools  from  elementary  geometry  and  topology 
will  be  needed  to  generalize  the  bounds  to  edges  bounding  cells  (see  Sections  4.3  through  4.5). 

4.1      A  Lemma  on  Bipartite  Graphs 

Let  Q  =  (iWuiV,  A)  be  a  directed  bipartite  graph  with  sources  in  M  and  sinks  in  .V,  that  is,  A/fi.V  =  0 
and  (/i,  i/)  G  A  only  if  p.  G  M  and  v  €  N.  Q  is  complete  if  A  =  M  x  .V  in  which  case  we  write 
Q  =  A'm>n,  m  =  \M\  and  n  =  \N\.  Another  bipartite  digraph  Q'  =  (.V/'C.V,  A')  is  a  subgraph  of  Q  if 
there  are  one-to-one  functions  /  :  M'  —  ,U  and  g  :  N'  —  N  so  that  (f(n'),  g(v'))  €  A  if  (/i'.f')  S  A'. 
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Clearly,  if  Q'  is  a  subgraph  of  Q  then  |.V/'|  <  M  and  .V  <  .V.  We  also  say  that  Q  contains  no 
K ,,t  if  A'j.t  is  not  a  subgraph  of  Q. 

The  following  lemma  establishes  upper  bounds  on  the  size  of  .4  depending  on  the  values  of  i  and 
t  such  that  Q  contains  no  K,,t.  The  lemma  is  a  variant  of  various  graph-theoretic  extremal  results 
obtained  by  Kovari,  Sos  and  Turan  [44;  and  Erdos  [26 j .  For  completeness  we  include  a  proof  of  the 
lemma. 

Lemma  4.1  (bipartite  graph  lemma).  Let  Q  =  (.V/Jj.V,  A)  be  a  bipartite  digraph  with  m  =  \[ 
sources  and  n  =  ,.Vj  sinks.  If  there  are  constants  s  and  (  so  that  Q  contains  no  K,,t  then  .4  = 
0(mnl-1/j  -  n)  and,  symmetrically,   .4!  =  0(nmI-1/e  -  m). 

Proof.  We  just  prove  the  first  upper  bound  —  the  second  follows  by  symmetry.  Let  us  define  an 
3-regular  hypergraph15  H  =  [M,H]  as  follows: 

for  every  1/61V  adjacent  to  Ml>M2< ••  •»/**  6  -W  we  add  [nufii, ...,/ta}  to  H . 

With  this  definition  the  same  3-tuple  can  be  added  to  H  two  or  more  times,  thus  H  is  reallv  a 
multi-hypergraph.  On  the  other  hand,  any  s  nodes  in  M  are  adjacent  to  at  most  t  —  1  common  nodes 
in  .V  by  assumption  which  limits  the  number  of  s-tuples  in  H  to  at  most  (t  -  l)(m). 

Let  now  n,  be  the  number  of  nodes  in  M  adjacent  to  i/,  G  N  for  L  <  t  <  n.  By  definition  of  H. 
V{  gives  rise  to  ("')  5-tuples  in  H  (which  is  0  when  a,  <  s).  Thus. 

i  (:)<-(-»(: 

From  this  we  get 

where  the  summation  extends  over  all  t  such  that  n,  >  s.  We  can  now  use  the  Holder- Minkowsky 
inequality  to  derive 

n 

Vn,  <  {s-  l)n-  Y,  (n«  ~(J-  1))  <(*-  i)n+nl-l/'{t  -  l)l'm. 

i=l  n,  >j 

This  proves  the  lemma  since  \A\  =  ^JLj  n,  and  s  and  t  are  constants  by  assumption. 

Remarks.  (1)  The  final  inequality  in  the  proof  of  the  bipartite  graph  lemma  holds  independent  of 
whether  or  not  s  and  t  are  constants.  We  thus  get  the  following  slightly  stronger  upper  bounds  on 
the  cardinality  of  A: 

\A\  <  min{mnl-l/'(t  -  l)1/j  -  n(s  -  1),  nml-^'(3  -  i)l/t  -  m{t  -  L)}. 

(2)  For  s  =  t  =  2  it  is  known  that  the  bound  in  the  bipartite  graph  lemma  is  asymptotically- 
tight  —  the  best  constants  have  been  derived  by  Reiman  '51;.   This  implies  that  the  first  bound  in 

A  hypergraph  is  a  pair  ( .Y.  V),  where  X  is  a  set  of  nodes  and  Y  is  a  set  of  subsets  of  V".  It  u  $-rtguiar  if  every 
subset  in  Y  contains  exactly  j  nodes. 


Combinatorial  Complexity  Bounds  for  Arrangements  15 

the  lemma  is  asymptotically  tight  if  s  =  2  as  long  as  t  is  a  constant.  No  matching  lower  bounds  are 
known  for  the  case  s  >  2. 

(3)  In  Section  6.2  we  will  show  a  generalization  of  the  bipartite  graph  lemma  where  Q  is  allowed 
to  contain  some  copies  of  A',,£  but  not  too  many.  This  generalization  will  prove  useful  when  we  count 
incidences  between  points  and  spheres  in  three  dimensions. 

4.2      Counting  Incidences 

The  bipartite  graph  lemma  can  be  used  to  obtain  upper  bounds  on  the  maximum  number  of  incidences 
between  m  points  and  n  curves  of  certain  kinds.  The  curves  that  we  consider  are  lines,  pseudolines. 
unit-circles,  (general)  circles,  and  pseudocodes.  We  do  not  want  to  imply,  however,  that  the  methods 
of  this  paper  are  restricted  to  these  curves  only.  The  notions  "pseudoline"1  and  "pseudocode"'  need 
some  explanation.  A  pseudoline  is  a  simple  curve16  unbounded  at  both  ends,  and  a  family  of 
pseudolines  has  the  property  that  any  two  pseudolines  in  the  set  intersect  in  exactly  one  point 
where  they  cross.  A  pseudocircle  is  a  simple  closed  (and  thus  bounded)  curve1',  and  a  family  of 
pseudocodes  is  a  set  of  such  curves  so  that  any  two  intersect  in  at  most  two  points  and  if  they 
intersect  in  two  points  then  they  cross  there. 

We  do  not  claim  that  all  of  the  following  bounds  are  original  —  in  fact  some  of  them  appear  at 
various  places  in  the  literature.  We  include  them  because  they  are  needed  for  our  developments  in 
Section  5. 

Canham  Threshold  4.2  The  maximum  number  of  incidences  between  a  set  M  of  m  points  and  a 
family  .V  of  n  curves  is 

(i)  0(mn"2  —  n)  and  0(nm1^2  +  m)  if  iV  is  a  family  of  lines,  pseudolines,  or  unit-circles,  and 
(ii)  0(mn2'3  —  n)  and  0(nm1^2  +  m)  if  .V  is  a  family  of  circles  or  pseudocodes. 

Proof.  We  discuss  lines  and  pseudolines,  unit-circles,  and  circles  and  pseudocodes  in  this  sequence. 
In  each  case  we  consider  the  directed  graph  Q  —  (A/'JJV,  A)  with  {y-.v)  S  A  if  y.  €  v. 

If  .V  is  a  set  of  Ones  or  pseudolines,  then  no  two  points  lie  on  two  or  more  common  lines  or 
pseudo-lines  which  Onplies  that  Q  contains  no  A*2,;.  The  bounds  follow  from  the  bipartite  graph 
lemma. 

The  case  where  N  is  a  set  of  n  unit-circles  is  slightly  less  trivial.  Any  two  points  lie  on  at  most 
two  common  unit-circles  which  shows  that  Q  contains  no  AS,3.  Oi  addition,  any  two  unit-circles 
intersect  in  at  most  two  points  which  implies  that  Q  contaOis  no  A'3,2  either.  The  bounds  follow 
readily  from  the  bipartite  graph  lemma. 

Finally,  when  .V  is  a  family  of  circles  or  pseudocodes,  then  Q  contains  no  A'3,2  because  no  three 
points  can  have  two  common  (pseudo-)  circles.  G 


18 A   simple  curve  is  the  graph  of  a  continuous  one-to-one  mapping  /  from  an  open  interval  to  the  plane.     It  is 
unbounded  at  both  ends  if  /(i)  tends  to  infinity  as  z  approaches  either  endpoint  of  the  interval. 

1   A  simple  closed  curve  is  the  graph  of  a  continuous  one-to-one  mapping  from  a  circle  to  the  plane. 
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Remarks.  (1)  Using  the  bipartite  graph  lemma  we  can  also  show  that  the  number  of  incidences  is 
0(nm1,2  -  m)  if  .V  is  a  family  of  arbitrary  curves,  provided  any  two  curves  intersect  in  at  most  some 
constant  number,  s  -  1,  of  points  so  the  corresponding  y  contains  no  A',. 2- 

(2)  It  is  interesting  to  observe  that  the  asymptotic  bounds  for  Lines  and  pseudolines  also  hold  for 
unit-circles  in  spite  of  the  fact  the  Q  (defined  for  unit-circles)  contains  many  AS./s.  This  shows  that 
it  is  useful  to  formulate  the  bipartite  graph  lemma  for  forbidden  A',,t  in  the  directed  sense  (that  is. 
without  implicitly  forbidding  A't,,  as  well  if  s  =  t).  Of  course,  we  could  also  partition  each  unit-circle 
into  a  northern  and  a  southern  half-circle  and  consider  two  digraphs,  one  for  the  northern  and  one 
for  the  southern  half-circles,  each  of  which  has  no  A':. 2- 

(3)  The  arguments  used  in  the  proof  of  Canham  Threshold  4.2  are  purely  combinatorial  —  it  is 
therefore  not  surprising  that  there  is  no  difference  between  lines  and  pseudolines  and  between  circles 
and  pseudocodes. 

(4)  The  only  obstacle  to  proving  0(mn1/:  -  n)  for  general  circles  is  that  two  points  can  lie  on  an 
arbitrary  large  number  of  common  circles18.  Besides  unit-circles  there  are  a  couple  of  other  natural 
cases  where  such  configurations  are  prohibited  thereby  leading  to  this  improved  bound.  Those 
include  sets  of  circles  where  the  maximum  number  of  collinear  centers  or  the  number  of  different 
radii  is  bounded  from  above  by  some  constant.  (See  also  remark  (3)  in  Section  5.4.) 

(5)  Canham  Threshold  4.2  implies  that  the  number  of  incidences  is  0(  n)  as  long  as  m.  the  number 
of  points,  is  0(n1,2)  in  the  case  of  lines,  pseudoLines,  and  unit-circles,  and  0(n1,3)  in  the  case  of 
(general)  circles  and  pseudocodes. 

4.3      Cells  in  (Pseudo-)  Line  Arrangements 

An  asymptotic  version  of  Canham's  bound  on  the  number  of  edges  bounding  m  cells  in  an  arrange- 
ment of  n  lines  can  be  obtained  from  the  bipartite  graph  lemma  using  the  fact  that  for  any  two  cells 
there  are  at  most  four  lines  that  contain  edges  of  both  cells.  This  is  because  any  two  disjoint  and 
convex  sets  have  at  most  four  common  supporting  lines,  and  cells  in  a  line  arrangement  are  certainly 
convex  and  disjoint.  It  is  also  true  that  any  two  cells  in  an  arrangement  of  pseudolines  share  at  most 
four  pseudolines  supporting  edges  of  both  cells  —  but  this  is  more  difficult  to  show.  We  prove  a 
bound  on  the  number  of  edges  bounding  m  cells  in  a  pseudoUne  arrangement  without  showing  that 
two  cells  share  at  most  four  pseudolines.  The  following  straightforward  topological  lemma  will  be 
handy. 

Lemma  4.3  Let  C  be  a  simple  curve  (either  closed  or  unbounded  at  both  ends)  that  is  partitioned 
into  three  (connected)  pieces.  There  exist  no  two  disjoint  connected  regions  so  that  both  lie  on  the 
same  side  of  C  and  touch  all  three  pieces  of  C. 

Proof.  Assume  that  two  such  regions,  R\  and  R.2,  exist.  We  can  draw  a  third  region.  i?3,  on  the 
other  side  of  C  so  that  R3  touches  all  three  pieces  of  C  (see  Figure  4.1).   But  this  implies  a  plane 


"Ln  fact,  a  result  of  Erdos   [24]  on  counting   the  number   of  different   distances   in  a  set  of  points  shows   that   this 
improvement  is  impossible  (see  remark  (2)  in  Section  5.4). 
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Figure  4.1:  Reduction  to  a  planarity  argument. 

embedding  of  A'3.3  since  the  three  pieces  of  C  can  be  contracted  to  points  and  connected  to  a  point 
each  in  flj,  R*,  and  R3  without  creating  an  intersection.  □ 

Lemma  4.3  will  repeatedly  be  used  in  arguments  about  (pseudo-)  circles  and  also  unit-circles  in 
Sections  4.4  and  4.5.  Again,  we  do  not  claim  originality  for  the  following  result. 

Canham  Threshold  4.4  The  maximum  number  of  edges  bounding  m  cells  in  an  arrangement  of 
n  Lines  or  pseudolines  is  0(mn1/2  +  n). 

Proof.  We  only  need  to  prove  the  statement  for  pseudolines.  First,  we  replace  each  pseudoline.  L 
by  the  two  (pseudo-)  half-planes  that  are  the  connected  components  of  E2  -  L  Let  N  be  the  set  of 
2n  half-planes  and  consider  Q  =  (XItiN,A),  where  XI  is  the  collection  of  m  cells  and  {fi,v)  £  .4  if 
cell  (i  is  contained  in  half-plane  u  and  the  boundary  of  1/  contains  an  edge  of  \i. 

We  show  that  Q  contains  no  A'2.3.  Assume  Q  does  contain  a  A'2,3.  Then  there  are  three  half-planes 
hi,  h2,  /13  and  two  cells  in  hi  0  Aj  n  /13  so  that  each  cell  touches  the  boundary  of  each  half-plane.  By 
definition  of  a  family  of  pseudolines.  /it  fl  /12  n  /i3  is  is  connected,  its  boundary  is  a  simple  curve, 
and  the  pseudoline  I,  bounding  h,  intersects  the  simple  curve  in  a  connected  piece  for  i  =  1.2.3. 
We  thus  have  a  decomposition  of  this  simple  curve  into  three  connected  pieces  so  that  both  cells 
touch  all  pieces.  But  this  contradicts  Lemma  4.3.  The  assertion  now  follows  because  each  [fi,  u)  £  .4 
contributes  one  edge  to  the  boundary  of  cell  \i.  D 

Remark.  Notice  that  the  graph  Q  defined  in  the  above  proof  can  contain  subgraphs  A',,2  for  5  up  to 
n  -  L  (even  in  the  case  of  lines).  This  keeps  us  from  using  the  second  bound  in  the  bipartite  graph 
lemma  to  prove  0(nm^2  +  m)  as  a  bound  on  the  number  of  edges  bounding  m  cells.  Nevertheless. 
this  bound  is  correct  and  a  proof  can  be  found  in  [15]. 

4.4      Cells  in  (Pseudo-)  Circle  Arrangements 

In  this  section  we  establish  Canham  thresholds  for  cells  in  arrangements  of  circles  and  pseudocodes. 
The  bounds  are  the  same  in  both  cases  so  we  can  restrict  ourselves  to  pseudocodes  which  are  more 
general  than  circles.  In  the  proof  we  replace  each  pseudocode,  c.  by  the  two  so-called  pseudodisks  that 
are  the  connected  components  of  E2  -  c.  First,  we  prove  a  technical  lemma  about  three  pseudodisks 
and  three  connected  regions. 
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Lemma  4.5  It  is  impossible  to  have  three  pseudodisks  d\,d2,dz  and  three  pairwise  disjoint,  con- 
nected regions  contained  in  d\  n  d-y^d^  so  that  each  region  touches  the  boundary  of  each  <f,.  z  =  1.  2.  3. 

Proof.  We  can  restrict  our  attention  to  simple  arrangements  since  a  non-simple  arrangement  can  be 
made  simple  by  adding  vertices,  edges,  and  cells,  and  thus  making  it  strictly  easier  to  find  such  three 
regions.  Consider  all  topologically  different  simple  arrangements  of  three  pseudocodes  as  shown  in 
Figure  4.2.  We  consider  three  cases  depending  on  the  shape  of  D  =  d t  n  d2  n  ^3. 


o 

o 
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Figure  4.2:    The  fourteen  different  simple  arrangements  of  three  pseudocircles.    The  dashed  lines 
indicate  regions  with  four  edges,  shading  is  used  for  disconnected  regions. 


Case  1.  D  is  connected  and  has  three  or  fewer  edges.  (This  includes  cases  where  D  is  not  simply- 
connected.)  Examples  are  all  cells  in  Figure  4.2  (a)  through  (n)  not  explicitly  mentioned  in 
Cases  2  and  3.  If  D  has  less  than  three  edges  then  there  is  nothing  to  prove.  Otherwise,  the 
existence  of  three  disjoint  regions  in  D  that  touch  all  three  edges  contradicts  the  non-planarity 
of  h'3,3. 

Case  2.  D  is  disconnected.  Examples  are  the  shaded  parts  in  Figure  4.2  (1),  (m),  and  In).  In  each 
such  case,  D  consists  of  two  simply-connected  components  bounded  by  three  edges  each.  By 
Lemma  4.3  each  component  can  contain  only  one  region  touching  all  three  edges  which  makes 
at  most  two  regions  altogether. 

Case  3.  D  is  connected  and  bounded  by  four  edges.  There  are  12  such  D's  indicated  by  dashed 
boundary  Lines  in  Figure  4.2.  In  each  case,  D  is  simply  connected.  Two  opposite  edges  belong 
to  a  common  pseudocode  (bounding  a  <it)  and  can  be  connected  by  a  simple  curve  that  lies 
completely  outside  D.  Thus,  the  existence  of  three  regions  in  D  each  touching  one  of  the  two 
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edges  that  belong  to  the  same  pseudocode  as  well  as  both  of  the  other  two  edges  contradicts 
the  non-planarity  of  A'3,3. 

The  three  cases  exhaust  all  possibilities  for  dx  n  <f2  n  d3.  rj 

Remarks.  (1)  Note  that  all  fourteen  different  arrangements  of  three  pseudocircles  can  be  realized 
by  three  circles  as  shown  in  Figure  4.2. 

(2)  A  possible  negative  point  of  the  above  proof  is  that  it  relies  on  a  (presumably)  exhaustive 
enumeration  of  all  simple  arrangements  of  three  pseudocircles.  Alternatively,  we  could  argue  directly 
that  D  can  only  assume  the  three  types  of  shapes  considered  in  Cases  1,  2,  and  3. 

We  use  Lemma  4.5  to  prove  the  main  result  of  this  section. 

Canham  Threshold  4.8  The  maximum  number  of  edges  bounding  m  cells  in  an  arrangement  of 
n  circles  or  pseudocircles  is  0(mn:/3  -  n). 

Proof.  Besides  Lemma  4.5  we  need  the  following  result  proved  (in  a  contrapositive  form)  in  '40;: 

the  intersection  of  k  >  3  pseudodisks  is  bounded  by  at  most  6k  -  12  edges. 

This  implies  that  the  number  of  edges  bounding  a  cell  in  an  arrangement  is  at  most  six  times  the 
number  of  (pseudo-)  circles  that  contribute  edges  to  the  boundary  of  the  cell. 

Let  now  M  be  the  set  of  m  cells  and  N  be  the  set  of  2n  pseudodisks  (two  for  each  pseudocode), 
and  consider  the  graph  Q  -  (M(jN,A)  with  (/x,  f)  £  ^  if  cell  /i  is  contained  in  pseudodisk  v  and  the 
boundary  of  u  contains  an  edge  of  ^.  By  Lemma  4.5  Q  contains  no  K3,z-  The  bound  stated  in  the 
assertion  now  follows  from  the  bipartite  graph  lemma  (Lemma  4.1). 

Remark.  Since  0(nml/2  -f  m)  is  an  upper  bound  on  the  number  of  incidences  between  m  points 
and  n  (pseudo-)  circles,  one  might  conjecture  that  the  same  bound  holds  for  the  number  of  edges 
bounding  m  cells.  This  may  be  true  but  the  currently  best  upper  bound  of  this  form  is  0(  A4(n)m1'2) 
which  follows  from  the  investigations  in  [18].  Here  A4(n)  is  superlinear  only  by  a  factor  that  depends 
on  the  inverse  of  Ackermann's  function  (see  Section  5.5). 

4.5     Cells  Defined  by  Unit-Circles 

For  m  cells  in  an  arrangement  of  n  unit-circles  we  can  prove  an  upper  bound  on  the  total  number  of 
edges  bounding  the  m  cells  that  is  asymptotically  smaller  than  the  bound  derived  for  general  circles 
(Canham  Threshold  4.6).  The  special  property  of  unit-circles  that  admits  this  improvement  is  that 

no  unit-circle  encloses  an  antipodal  point  pair19  of  another  unit-circle. 

This  property  implies  that  of  the  two  arcs  into  which  a  unit-circle  is  cut  by  another  unit-circle,  onlv 
the  shorter  arc  can  be  enclosed  by  the  other  unit-circle.  The  upper  bound  on  the  number  of  edges 


'Two  points  of  a  circle  are  antipodal  if  they  are  collinear  with  the  center  of  the  circle. 


Combinatorial  Complexity  Bounds  foe  Arrangements  20 

that  we  are  going  to  establish  is  0(mn1/:  4-  n)  (Canham  Threshold  4.7  below)  which  indicates  that 
we  will  use  a  forbidden  A'^.j  argument,  that  is,  no  two  cells  touch  t  common  unit-circles  (we  choose 
t  =  3).  This  is,  however,  not  true  in  general  for  any  t  as  shown  in  Figure  4.3.   We  get  around  this 


Figure  4.3:  Cells  cx  and  ci  touch  five  common  unit-circles. 

difficulty  by  considering  a  constant  number  of  different  cases  and  using  a  forbidden  A'2.3  argument 
for  each  but  one  case. 

Canham  Threshold  4.7  The  maximum  number  of  edges  bounding  m  cells  in  an  arrangement  of 
n  unit-circles  in  the  plane  is  0(mn"2  4-  n). 

Proof.  Let  M  denote  the  set  of  m  cells,  let  N  be  the  set  of  n  unit-circles,  and  define  y  -  ( A/l.V.  A  . 
with  (c,u)  €  A  if  cell  c  touches  unit-circle  u.  As  mentioned  above,  we  do  a  case-analysis  which,  in 
effect,  considers  Q  as  the  union  of  a  constant  number  of  subgraphs.  The  cases  that  we  distinguish 
are 

Case  1.  Q\  —  (AfUiV,  ,4;),  where  {c,u)  €  A\  if  c  touches  u  and  is  enclosed  by  u. 

Case  2.  Q0  =  (A/U:V.  .4„),  where  (c,  u.)  £  A0  if  c  touches  u  and  is  not  enclosed  by  u.  Note  that 
A  -  A\  u  A0.  We  distinguish  three  not  necessarily  disjoint  subcases. 

Case  2.1.  Cells  that  lie  outside  all  unit-circles  of  N. 

Case  2.2.  Pairs  of  cells  enclosed  by  at  least  one  common  unit-circle. 

Case  2.3.  Pairs  of  cells  so  that  for  each  cell  there  is  a  unit-circle  in  .V  that  encloses  it  but  not 
the  other  cell. 

We  next  discuss  these  cases  in  sequence.  It  turns  out  that  Cases  1  and  2.1  are  fairly  straight- 
forward and  that  Cases  2.2  and  2.3  are  more  complicated  than  one  might  hope.  To  simplify  the 
discussion  we  define  a  unit-disk  as  the  closed  disk  bounded  by  a  unit-circle. 

Case  1.  Here  we  show  that  Q\  contains  no  A'2.3,  that  is,  it  is  not  possible  that  two  cells  contained 
in  three  common  unit-disks  both  touch  all  three  bounding  unit-circles.  To  see  why  this  is  true 
for  unit-circles  (it  is  not  true  for  general  circles  as  can  be  shown  for  the  arrangement  of  Figure 
4.2  (m))  observe  that  the  boundary  of  the  intersection  of  three  unit-disks  consists  of  at  most 
three  edges.    For  suppose  that  d\  n  d2  <~i  ^3  (d,  bounded  by  the  unit-circle  u, )  has  two  edges 
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contributed  by  u2.  Then  dx  contains  both  those  edges  plus  one  of  the  other  two  edges  u2  is  cut 
into  by  ux  and  1x3.  Either  those  three  edges  contain  an  antipodal  point  pair  of  u2  or  the  three 
similarly  defined  edges  of  u2  contained  in  d3  contain  such  a  pair.  Thus.  Lemma  4.3  applies  and 
proves  the  rest. 

Case  2.1.  As  proved  in  40]  the  union  of  n  pseudodisks  is  bounded  by  at  most  6n  -  12  edges  if 
n  >  3.  Unit-disks  are  just  special  pseudodisks  and  thus  the  bound  applies  here  too.  Since 
6ra  -  12  is  certainly  within  0(mn1'2  +  n)  we  can  ignore  such  cells  from  hereon  altogether. 

Case  2.2.  Take  two  cells,  cx  and  C2,  contained  in  a  unit-disk  bounded  by  a  unit-circle  in  .V.  In 
contrast  to  Case  2.1.  both  cells  are  fairly  small  and  therefore  easier  to  control.  Our  goal  is 
to  prove  that  cx  and  c2  cannot  touch  three  unit-circles  and  lie  outside  all  three  of  them.  To 
prove  this  we  assume  the  contrary,  that  is,  there  are  three  unit-circles  ux,  u2,  u3  so  that  cx  and 
c?  touch  and  lie  outside  all  three.  Let  u0  be  the  unit-circle  that  encloses  cx  and  c2,  and  let  d, 
denote  the  unit-disk  bounded  by  U{,  0  <  i  <  3. 

Consider  the  Voronoi  diagram  of  the  centers  px,p2,p3  of  ux,u2,u3.  I*  consists  of  regions 
RX,R2,R3  so  that  fl,  =  {x  £  £2:d(z,p,)  <  d(x,p}),  j  56  1}  for  i  =  1,2.3  (see  Figure  4.4). 
Each  Rz  is  the  intersection  of  two  closed  half-planes  and  thus  convex.  The  nice  property  of  the 


Figure  4.4:    The  Voronoi  diagram  of  the  centers  of  ux,u2,u3.    Ln  the  hrst  case  the  union  of  the 
unit-disks  has  four  edges,  in  the  second  case  it  has  three. 

Voronoi  diagram  is  that  dx  U  d2  U  d3  =  (dx  n  Rx)  u  {d2  n  R2)  U  (<i3  D  R3)  which  amounts  to  a 
decomposition  of  dx  U  d2  U  ^3.  We  distinguish  two  subcases. 

Case  2.2.1.  u0  n  {di  n  R%)  £  0  for  t  =  1,2,3  (see  Figure  4.4,  left).  We  show  that  if  ct  and  c2 
exist  with  the  required  properties  then  we  have  a  plane  embedding  of  a  A'3.3  and  thus  a 
contradiction.  By  assumption,  cx  and  c2  are  contained  in  do  —  {d\  u  d2  Li  d3).  Since  both 
cells  touch  i»i,  u2,u3  they  also  touch  d0^dx  fl  Rx,dQr,d2 i"1  R2,dQnd3n  R3  which  are  three 
convex  regions  with  pairwise  disjoint  interiors  enclosed  by  u0.  Since  each  do  n  d,  ~  /?,, 
i  =  1,2,3,  touches  u0  by  assumption  we  can  construct  a  region  outside  d0  that  touches 
each  d0  n  dx  n  Rx.  Contracting  cx ,  c2,  and  the  four  regions  gives  a  plane  embedding  of  A'3.3. 

Case  2.2.2.  u0  n  (dx  n  Rx)  =  0  (see  Figure  4.4,  right).  We  argue  that  dx  0  d2  U  d3  is  simply 
connected  and  bounded  by  three  edges  only;  Lemma  4.3  will  prove  the  rest.  As  mentioned 
before,  the  union  of  the  three  unit-disks  is  equal  to  (dx  ^  Rx)  J  {d2  ~i  R2)  U  (^3  n  #3). 
Therefore,  the  union  is  simply  connected  and  bounded  by  only  three  edges  if  u,  n  Rt  is 
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connected  for  i  =  1,2,3.  Since  dx  n  Rx  is  convex  and  contains  the  center  of  ux,  U\  "  Rx 
contains  two  antipodal  points  of  ux  (which  is  impossible  since  ux  n  Rx  is  enclosed  bv  u0) 
unless  dx  contains  the  only  vertex  of  the  Voronoi  diagram  of  px.  p;,  and  p3.  This  vertex 
is  equally  far  from  each  of  the  three  centers  which  implies  that  all  three  d,  contain  the 
vertex.  Consequently,  u,  "I  Rt  is  connected  for  i  =  1,2.3. 

Case  2.3.  Here  we  consider  two  cells.  cx  and  c?,  in  the  arrangement  denned  by  .V  so  that  there 
are  u_i,u0  £  .V  with  C\  C  d_j  -  d0  and  c2  C  d0  -  d_t  (see  Figure  4.5).    It  follows  that  the 


Figure  4.5:  Cells  cj  and  c2  are  enclosed  by  two  different  unit -circles. 

perpendicular  bisector  of  the  centers  of  u_j  and  u0,  line  (..  separates  cx  and  c-t. 

We  decompose  each  unit-circle  into  a  constant  number  of  equally  large  and  pairwise  disjoint 
arcs,  called  caps.  The  goal  is  to  show  that  it  is  impossible  that  cx  and  ci  touch  the  same  caps 
of  three  unit-circles  ui,  U2,  U3  (the  three  caps  are  translates  of  each  other)  and  lie  outside  Ui, 
u-y,  and  uj.  This  decomposition  into  caps  corresponds  to  writing  .4„,  the  set  of  arcs  of  Q0,  as 
the  union  of  a  constant  number  of  arc  sets.  (Of  course,  we  consider  only  arcs  connecting  two 
cells  that  fall  into  Case  2.3.)  It  is  sufficient  to  consider  only  one  type  of  cap. 

Define  the  south-cap.  u,  of  a  unit-circle,  u,  as  the  cap  whose  midpoint  is  the  lowest  point  of  u 

and  whose  length  if  2a,  with 

2x       n  y/2  -  1 

a  =  —  5:  0.101  <  . 

31  4 

Thus,  u  is  decomposed  into  31  caps  one  of  which  is  u.   The  choice  of  a  implies  that  any  two 

south-caps  can  intersect  in  at  most  one  point  (this  is  true  for  any  a  <  x  2).  If  c>  and  ci  are  to 

touch  the  south-caps  of  ux,  u?,  U3  6  -V  so  that  cx  and  c2  lie  outside  the  three  unit-circles,  then 

fix,  U2i  "3  must  intersect  d_x  -  do  as  well  as  d0  -  d„x  and  therefore  they  must  intersect  line  t. 

We  distinguish  between  two  cases  depending  on  the  slope  of  I  which  we  assume  without  loss 

of  generality  to  be  non-negative. 

Case  2.3.1.  The  slope  of  I  is  at  least  1  (this  includes  the  case  where  t  is  vertical).  In  this 
case,  t  intersects  each  ut,  1 ;  =  1.  2,  3,  in  at  most  one  point.  Let  ux .  u?.  u3  be  the  sequence 
so  that  the  y-coordinates  of  ax  1  I,  u2  C\  (,  u3  1  (  are  in  increasing  order.  The  intersection 
d,  ("I  t,  1  =  1,2,3,  is  a  Line  segment  that  is  longer  than  v  2  -  2a  since  t  intersects  b,. 
This  implies  that  two  consecutive  intersections  of  t  with  fix,  uj,  £3  are  more  than  v  2  -  2a 
apart20.   Hence,  the  distance  between  ux  ~  I  and  H3  ~  t  is  greater  than  2\  2  -  4a.    Since 


T!ui  13  immediate  as  long,  as  the  slope  of  I  guarantees  that  u,_,  intersects  u,  in  at  most  one  point,  i  =  1.  2.   If  this  is 
so  then  <i,.i  ~  (  must  he  outside  u,  —  otherwise,  the  part  of  ii,.  i  on  one  side  of  (  is  enclosed  by  u,  and  can  therefore  not 
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the  length  of  a  south-cap  is  2a.  this  implies  that  no  two  points  of  &!  and  u3  are  within  a 
distance  2V2  -  8a  >  2  of  each  other.  We  see  that  no  cell  that  is  enclosed  bv  a  unit-circle 
can  touch  both  iii  and  u3. 

Case  2.3.2.  The  slope  of  t  is  in  [0.  Lj.  The  difference  to  Case  2.3.1  is  that  two  south-caps  can 
now  be  arbitrarily  close  to  each  other  without  immediately  contradicting  the  existence 
of  cells  C\  and  c2  touching  Ui,U2iUS<  Since  t  cannot  be  vertical  we  can  talk  without 
ambiguity  about  points  (vertically)  above  or  below  i.  Since  a  <  x/4.  the  center  of  any 
unit-circle  whose  south-cap  intersects  t  lies  above  I  and  the  south-cap  lies  below  the  line 
parallel  to  t  that  goes  through  the  center  of  the  unit-circle. 

We  first  examine  how  two  south-caps  can  intersect  relative  to  I.  As  noted  already,  they 
can  intersect  in  at  most  one  point.  If  t  intersects  u,  and  u;  in  one  point  each,  then  u, 
and  Uj  cannot  intersect  at  all  —  otherwise,  the  part  of  one  south-cap  on  one  side  of  I 
is  enclosed  by  the  other  unit-circle.  Even  if  I  intersects  one  or  both  south-cap(s)  in  two 
points  (each),  the  two  caps  cannot  intersect  below  I.  For  if  they  do.  the  complement 
of  d,  u  dj  above  I  is  simply  connected.  The  line  parallel  to  I  through  the  center  of  u, 
lies  above  the  two  caps,  and  any  cell  above  t  that  touches  d\  and  u2  intersects  this  line 
on  both  sides  of  ux  (see  Figure  4.6).    Thus,  the  line  contains  two  points  of  the  cell  at 


Figure  4.6:  Two  south-caps  cannot  intersect  below  I.  Here,  the  size  of  the  south-caps  is  exaggerated. 

distance  greater  than  2  from  each  other.  This  contradicts  the  requirement  that  the  cell  is 
enclosed  by  a  unit-circle.  Hence,  the  south-caps  can  be  sorted  from  left  to  right  by  their 
intersection(s)  with  I.  Let  u1,u2,  U3  be  the  order.  By  the  same  reasoning  as  above  a  cell 
above  I  that  touches  all  three  south-caps  must  intersect  the  line  parallel  to  L  through  the 
center  of  u2>  the  middle  unit-circle,  on  both  sides  of  u2.  Again,  we  reach  a  contradiction 
since  this  ceil  does  not  fit  inside  a  unit-circle. 

In  summary,  we  showed  that  Q\  contains  no  Ki.z  (Case  1)  and  that  Q0  can  be  written  as  the 
union  of  32  graphs  each  without  any  A'2,3  (Cases  2.2  and  2.3)  after  removing  all  cells  that  lie  outside 
all  unit-circles.  The  latter  cells  are  bounded  by  at  most  6rc  -  12  edges  (Case  2.1).  The  removal  of 
these  cells  from  <5„  therefore  decreases  the  number  of  arcs  by  at  most  this  number.  Finally,  we  note 
that  the  number  of  edges  of  a  cell  in  the  arrangement  denned  by  ,V  has  at  most  six  times  as  many 


be  touched  by  c\  or  cj.  There  is,  however,  the  case  that  u,»i  intersects  u,  in  two  points.  Straightforward  trigonometry 
and  elementary  geometry  shows  that  the  slope  of  I  in  this  case  is  at  least  4.T  and  that  the  distance  between  u,  "1  t  and 
u,.i  fl  I  must  therefore  be  greater  than  \fz  —  2a  just  to  ensure  that  u,*i  does  not  lie  completely  inside  d,. 
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edges  as  there  are  unit-circles  that  contain  those  edges31  (again  by  a  result  of  [40i).   This  together 
with  the  bipartite  graph  lemma  (Lemma  4.1)  implies  the  upper  bound  of  the  assertion. 


5      General  Upper  Bounds 

This  section  presents  the  details  of  our  approach  to  proving  general  bounds  for  the  incidences  problem 
and  the  many  faces  problem  which  improve  the  Canham-like  bounds  obtained  in  Section  4.  The 
method  is  fairly  modular  and  consists  of  three  major  steps  —  not  counting  Canham  thresholds 
necessary  to  get  the  method  off  the  ground.  At  the  heart  of  the  method  is  a  probabilistic  counting 
result  which  will  be  presented  in  appropriate  generality  in  Section  5.2.  It  relies  on  a  decomposition 
of  the  cells  in  an  arrangement  into  regions  of  constant  description.  We  call  the  decomposition  a 
triangulation  although  the  regions  (funnels)  are  not  necessarily  triangles.  Triangulations  are  the 
topic  of  Section  5.1.  The  third  component  of  the  method  is  the  analysis  of  zones  in  arrangements 
which  will  be  discussed  in  Section  5.5.  Putting  all  three  components  together,  we  prove  upper  bounds 
on  the  number  of  edges  of  many  cells  in  Section  5.6.  Upper  bounds  for  incidences  can  be  established 
without  the  use  of  zones.  These  will  be  discussed  in  Section  5.3  immediately  followed  by  applications 
to  counting  distances  given  in  Section  5.4. 

5.1      Triangulating  Arrangements 

The  first  tool  that  we  need  in  order  to  obtain  the  funneling  partition  is  the  ability  to  "triangulate"  the 
cells  of  an  arrangement,  that  is,  to  break  them  up  into  finer  pieces  so  that  each  piece  is  bounded  by 
only  a  constant  number  of  curves  or  surfaces.  As  we  already  remarked,  the  reason  for  this  refinement 
is  to  justify  the  probabilistic  analysis  that  we  use  to  prove  that  the  funneling  subdivision  does  a  good 
job  of  distributing  the  manifolds  among  the  funnels.  The  refinement  is,  in  general,  accomplished  by 
using  auxiliary  manifolds  from  the  same  class  as  the  manifolds  forming  the  arrangement,  or  from  a 
reasonably  related  class.  In  doing  so  we  wish  to  keep  the  total  number  of  funnels  we  obtain  fairly 
close  to  the  number  of  cells  of  the  sample  arrangement. 

This  section  discusses  only  the  two-dimensional  case,  and  we  refer  to  Section  6.3  for  an  extension 
of  our  triangulation  technique  to  three  dimensions.  Triangulating  an  arrangement  of  lines  is  fairly 
straightforward  and  is  discussed  in  sufficient  detail  in  Section  3.  We  therefore  confine  this  section 
to  pseudolines,  circles,  and  pseudocodes.  The  main  part  consists  of  the  analysis  of  a  particular 
triangulation  of  a  circle  arrangement.  Thereafter  we  extend  such  triangulations  to  pseudolines  and 
pseudocodes. 

Arrangements  of  Circles.  Let  C  be  a  set  of  n  circles  in  the  plane32.  Each  cell  of  the  arrangement, 
A(C),  is  decomposed  by  drawing  vertical  line  segments  through  the  left-  and  rightmost  points  of  all 
circles.  These  line  segments  are  extended  as  far  as  possible  so  that  they  intersect  no  other  circles.  In 
addition,  we  draw  a  maximal  vertical  (relatively  open)  line  segment  through  every  vertex  of  AiC) 
so  that  the  only  intersection  with  the  circles  is  this  vertex  (see  Figure  5.1).  A  funnel  (also  called  a 


As  a  matter  of  fact,  the  blow  up  factor  is  at  most  two  which  can  be  shown  using  Davenport-Schinzel  sequences  of 
order  2  Isee  e.g.  [S3j). 

22  We  will  not  distinguish  between  the  general  case  and  the  case  of  unit-circles. 
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Figure  5.1:  Triangulating  a  circle  arrangement  using  trapezoidal  funnels. 

trapezoid)  is  thus  bounded  by  four  edges:  two  vertical  edges  to  the  left  and  right  and  two  circle  edges 
at  the  top  and  the  bottom.  Up  to  three  of  these  four  edges  can  be  degenerate,  that  is.  empty  or  at 
infinity  (see  Figure  5.1). 

In  order  to  count  the  number  of  funnels  generated,  we  perform  a  sweep-Line  argument  where  the 
line  is  vertical  and  sweeps  from  left  to  right.  Initially  (when  the  Line  is  to  the  left  of  all  circles),  it 
intersects  one  funnel.  Whenever  the  sweep-line  passes  the  leftmost  point  of  a  circle  it  encounters 
three  new  funnels  (assuming  the  circles  are  in  general  position),  and  when  it  passes  a  rightmost  point 
it  encounters  one  new  funnel.  When  the  line  passes  a  vertex  which  is  the  intersection  of  two  circles 
then  it  hits  three  new  funnels.  This  gives  an  upper  bound  of 


1  -i-  Zn  +  n  4-  6| 


=  3  n*  +  n  4-  1 


for  the  number  of  funnels  in  the  triangulation*3. 

Remarks.  (1)  The  decomposition  of  the  cells  into  "trapezoidal"  funnels  as  described  is  not  a  cell 
complex  since  quite  frequently  we  have  a  vertex  of  one  funnel  lie  in  the  middle  of  an  edge  of  an 
adjacent  funnel.  This  will  have  no  consequences  in  our  analysis. 

(2)  The  line-sweep  counting  argument  shows  that  the  majority  of  the  funnels  are  due  to  inter- 
sections of  circles.  In  fact,  if  we  define  t  as  the  number  of  intersecting  pairs  of  circles  in  C,  then  the 
number  of  funnels  is  at  most  6t  +  4n  +  1. 

Arrangements  of  Pseudolines  and  Pseudocodes.  The  decomposition  of  the  cells  into  trape- 
zoidal funnels  as  described  for  lines  in  Section  3  and  for  circles  above  can  readily  be  extended  to 
pseudolines  as  well  as  pseudocodes.  There  is,  however,  one  important  difference.  A  pseudoline  as 
well  as  a  pseudocircle  can  have  arbitrarily  many  jt -extreme  points*4,  and  we  would  have  to  draw  a 
maximal  vertical  line  segment  through  each  .r^-extreme  point.  In  order  to  avoid  complications  caused 
by  such  additional  separations,  we  assume  from  now  on  that  a  pseudoline  intersects  any  vertical  line 
in  at  most  one  point,  and  that  a  pseudocircle  intersects  any  vertical  Line  in  at  most  two  points. 


"This  analysis  ignores  cases  such  as  vertices  that  lie  on  more  than  two  circles  and  vertices  that  are,  at  the  same 
time,  left-  and/or  rightmost  points  of  circles.  Using  a  straightforward  perturbation  argument  one  can  show,  however, 
that  the  occurrence  of  such  cases  only  decreases  the  number  of  funnels  that  are  constructed. 

24  A  point  p  of  a  pseudoline  or  pseudocircle  v  is  ii-extreme  if  all  points  on  v  in  an  f-neighborhood  of  p  have  greater 
(or  smaller)  zi -coordinates  than  p. 
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Remarks.  (1)  For  a  pseudoline  arrangement  it  is  possible  to  avoid  the  additional  assumption  of  no 
*i-extreme  points  using  Levi's  lemma   45]: 

Let  A(L)  be  a  pseudoline  arrangement  and  p  and  q  be  any  two  points  not  both  on  the 
same  pseudoline.  Then  there  is  a  pseudoline,  L  passing  through  p  and  q  so  that  Ai  L  _  {(}) 
is  still  a  valid  pseudoline  arrangement. 

With  this  we  can  draw  a  piece  of  a  pseudoline  between  any  two  non-adjacent  vertices  of  a  cell 
(of  the  partially  triangulated  arrangement)  that  has  more  than  three  edges.  Every  funnel  of  this 
triangulation  is  bounded  by  at  most  three  edges  and  each  edge  intersect  every  pseudoline  in  at  most 
one  point  where  it  crosses  it.  For  reasons  of  uniformity  we  prefer  to  work  with  the  triangulation 
using  trapezoidal  funnels,  though. 

(2)  We  pose  the  generalization  of  remark  ( 1)  to  arrangements  of  pseudocircles  as  an  open  problem. 
Let  C  be  a  finite  set  of  pseudocircles  and  A(C)  the  arrangement  defined  by  C.  Prove  or  disprove 
that  for  any  two  points  p  and  q  there  is  a  pseudocode  c  so  that  p, q  G  c  and  A(C  J  {c})  is  still  a 
pseudocircle  arrangement. 

A  crucial  property  of  the  triangulations  described  in  this  section  is  a  certain  independence  prop- 
erty that  allows  us  to  decide  whether  a  trapezoid  is  part  of  a  triangulation  without  looking  at  its 
neighboring  trapezoids.  This  property  will  play  an  important  role  in  the  probabilistic  counting 
analysis  of  triangulations  in  the  next  section. 

Lemma  5.1  (independence  lemma).  Let  R  be  a  set  of  r  curves  (that  is.  lines,  pseudolines.  unit- 
circles,  circles,  or  pseudocircles).  A  trapezoid  A  is  in  the  triangulation  of  A(R)  if  and  only  if 

(i)  A  is  in  the  triangulation  of  A(R')  for  some  R'  C  R,  ]R"  <  4,  and 
(ii)  A  does  not  intersect  any  curve  in  R. 

Proof.  The  "'only  if"  part  is  straightforward  since  a  trapezoid  A  in  the  triangulation  of  A{  R)  satisfies 
(ii)  by  definition.  Furthermore,  the  top  and  bottom  of  A  lie  in  two  curves,  by  construction,  and  two 
additional  curves  suffice  to  determine  the  left  and  right  sides  of  A.  Thus.  A  is  in  the  triangulation  of 
these  at  most  four  curves.  (There  are  less  than  four  if  a  side  is  empty  or  at  infinity:  see  also  Figure 
3.2.) 

To  see  the  "'if"  part  notice  that  a  trapezoid  A  that  satisfies  (i)  and  (ii)  does  not  intersect  anv 
side  used  in  the  triangulation  of  A{R):  no  non-vertical  side  can  intersect  A  because  A  is  disjoint 
from  all  curves  in  R,  and  no  vertical  side  can  meet  A  because  every  vertex  in  A(R)  is  shielded  from 
A  by  at  least  one  of  the  curves  in  R'.  Thus,  A  is  contained  in  a  trapezoid  of  the  triangulation  of 
A{R).  But  then  this  trapezoid  must  be  equal  to  A  since  it  intersects  a  line  in  R',  otherwise.  Z 

Remark.  A  trapezoid  A  satisfying  (i)  is  said  to  be  defined  by  R'.  The  smallest  set  R'  that  defines 
A  is  unique  if  we  assume  that  the  curves  in  R  are  in  general  position. 
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5.2      Probabilistic  Counting  Results 

The  key  intuition  behind  our  partitioning  scheme  is  that  we  are  likely  to  do  well  if  we  partition  based 
on  sampling  our  own  data.  How  well?  For  any  given  R  of  size  r,  on  the  average  we  expect  roughly 
0(n/r)  manifolds  to  cut  each  funnel.  Loosely  speaking,  the  reason  is  that  a  manifold  will  typically 
cut  on  the  order  of  rd~l  funnels  and  there  are  on  the  order  of  rd  funnels  altogether.  The  ;-net  theory 
of  Haussler  and  Welzl  [37],  as  well  as  the  probabilistic  lemma  of  Clarkson  [ll!  then  guarantee  that, 
with  high  probability  over  the  random  selection  of  R.  every  funnel  in  the  funneling  subdivision  will 
behave  like  the  ■'average'1  funnel,  to  within  an  O(logr)  factor.  Since  we  want  a  best-possible  bound 
we  need  a  tighter  result  here.  We  invoke  another  result  also  reported  in  more  generality  in  Clarkson 
1 12!  that  states  that  if  we  let  the  sample  R  vary  the  expected  number  of  manifolds  cutting  this  funnel 
will  be  0(n/r),  with  no  logarithmic  factor.  We  use  this  fact  and  the  additivity  of  expectation  to 
obtain  our  result.  Recent  results  of  Chazelle  and  Friedman  [7]  and  Matousek  [47]  show  that  there 
exist  triangulations  for  which  every  funnel  is  like  the  average  to  within  a  constant  factor.  This 
stronger  bound  is  not  necessarv  to  obtain  our  combinatorial  results. 


'-5  - 


The  presentation  of  the  probabilistic  counting  result  in  this  section  is  general  enough  to  cover 
the  cases  of  lines,  pseudolines.  unit-circles,  (general)  circles,  and  pseudocircles.  Even  more  cases 
(including  spheres  in  three  dimensions)  are  included  in  the  more  general  but  also  more  involved 
treatment  in  [12]. 

So  let  C  be  a  set  of  n  curves  (of  a  type  listed  above)  and  let  P  be  a  set  of  m  points.  For  a 
subset  R  C  C  we  triangulate  the  arrangement  A{R)  and  denote  the  funnels  by  Ai  through  A*. 
Define  m,  =  \M  D  A,j  and  let  n±  be  the  number  of  curves  that  intersect  A,.  For  the  time  being 
we  assume  that  the  curves  satisfy  appropriate  general  position  requirements  and  that  the  points 
avoid  all  curves  as  well  as  vertical  funnel  sides.  With  these  definitions  and  assumptions  we  have  the 
following  preliminary  result. 

Lemma  5.2  Let  R  be  a  set  of  r  curves  in  the  plane  and  let  Qn  be  the  set  of  all  trapezoids  defined 
by  four  or  fewer  curves  in  R. 

(i)  At  most  four  trapezoids  in  Qs.  contain  some  fixed  point  p  and  intersect  exactly  one  curve  each, 
(ii)  The  number  of  trapezoids  in  Qr  that  intersect  exactly  one  curve  in  R  is  0(r2). 

Proof.  To  show  (i)  notice  that  only  one  trapezoid  in  £r,  we  call  it  Ah,  contains  p  and  meets  no 
curve  in  R.  Let  R'  be  the  set  of  at  most  four  curves  that  define  Ar.  A  trapezoid  A  chat  satisfies  the 
conditions  in  (i)  has  the  property  that  it  is  Ar_{cj  for  exactly  one  curve  c  £  R-  But  Ar_{cj.  =  Ar 
unless  c  £  R'-  The  claim  follows  since  there  are  at  most  four  curves  c  in  R' . 

It  is  slightly  more  complicated  to  establish  (ii).  Take  a  curve  c  £  R  and  consider  the  (unique) 
triangulation  of  A(R  —  {c}).  We  count  the  number  of  trapezoids  in  this  triangulation  that  meet 
c.  This  is  equal  to  (or  one  more  than)  the  number  of  points  where  c  meets  trapezoid  sides.  The 
number  of  such  points  that  lie  on  non-vertical  sides  is  O(r)  because  c  intersects  the  other  curves  in 
at  most  two  points  each  by  assumption.  When  we  take  the  sum  over  all  c  £  R  we  get  0(r2)  such 
intersections.  Each  vertical  side  of  a  trapezoid  in  the  triangulation  of  A(R  -  {c})  that  intersects 
c  overlaps  with  a  vertical  side  in  the  triangulation  of  A(R)  that  has  one  endpoint  on  c  at  a  point 
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where  c  does  not  intersect  any  other  curve.  It  is  fairly  difficult  to  bound  the  number  of  such  vertical 
sides  for  an  individual  c,  but  it  is  easy  to  compute  the  sum  over  all  c  £  R.  Namelv.  everv  vertex 
in  A(R)  and  every  point  with  vertical  tangency  generates  two  vertical  sides  each  footing  at  a  single 
curve  c.  By  assumption,  there  are  only  0(r:)  points  generating  vertical  sides  which  impLies  that  the 
total  number  of  such  intersections,  summed  over  all  c  E  R.  is  0(r2).  — 

Remark.  With  a  little  more  care  the  above  proof  can  be  used  to  compute  the  constants  hidden  in 
the  big- Oh  notation.  For  lines  and  pseudolines  we  have  Q  vertices  in  A{  R)  and  no  points  of  vertical 
tangency.  Thus  there  are  at  most  r2  +■  2Q  <  2r2  trapezoids  in  Or  that  meet  exactly  one  line  or 
pseudoline  in  R.  For  unit-circles,  general  circles,  and  pseudocodes  we  have  2(j)  vertices  in  .41  R)  and 
2r  points  of  vertical  tangency.  It  follows  that  the  number  of  trapezoids  in  Qr  that  intersect  exactlv 
one  curve  in  R  is  at  most  2r(r  -  1)  -  4Q  -  4r  =  4r2. 

We  are  now  ready  to  present  the  probabilistic  counting  argument  which  leads  to  a  result  that 
says  in  a  formal  way  that  every  funnel  intersects  0(rc, r)  curves  on  the  average.  We  use  k,  m,,  and 
rz,  as  defined  above. 

Lemma  5.3  (sampling  lemma).  For  every  0  <  a  <  1  and  4  <  r  <  n  there  is  a  subset  R  Z  C  with 
r  =    R\  so  that 

(i)  E*=i"K*?  =0(m(?)a)and 

(")  E*»i  *  =  O(m). 

Proof.  The  proof  consists  of  two  steps  each  applying  a  probabilistic  counting  argument.  The  first 
step  shows  that  the  expectation  of  JTA.  m,n" ,  taken  over  all  subsets  R  of  size  r,  is  bounded  from 
above  by  6\m(^z^)  for  some  constant  Si.  It  follows  that  there  exists  a  subset  R  for  which  this 
inequality  holds.  The  second  step  shows  that  the  expected  value  of  £a  i,  <  ^  ^r  for  some  constant 
ii.  Now,  the  subset  that  satisfies  the  first  inequality  may  not  satisfy  the  second  and  vice  versa. 
However,  by  the  nature  of  the  argument  we  know  that  more  than  50%  of  all  subsets  R  of  size  r 
satisfy  £a<  rntnf  <  2Sxm[^^)  ,  and,  by  the  same  token,  more  than  50%  of  the  subsets  R  satisfy 
Ha,  a<  ^  ^2~-  ^  follows  that  there  is  at  least  one  subset  R  that  satisfies  both  inequalities. 

Step  1.  As  indicated  above,  the  idea  of  the  proof  is  that  if  R  is  chosen  at  random,  then  the 
expected  value  of  ^^  m^n*  is  0(m(n/r)a).  To  compute  the  expected  value  of  the  above  random 
variable,  we  observe  that  the  sum  is  the  same  as  3Z7=i  9j*'  wnere  <7;  iS  l^-e  number  of  curves  intersecting 
the  funnel  that  contains  point  p;  t  P-  Clearly.  q}  =  a,  if  p:  £  A,.  Since  the  expectation  is  additive 
we  can  concentrate  on  showing  that  the  expected  value  of  q*  denoted  by  E  q* '',  is  0((n  r)a).  By 
Jensen's  inequality  (see  for  example  [30]),  E[q*\  <  E[qj]a,  so  it  remains  to  show  that  E'q/  -  0!  n   r). 

As  before  we  write  C  =  {ci,c?, . .  .,cn}  for  the  set  of  curves  and  we  let  R  be  a  set  of  r  >  4 
sample  curves,  chosen  randomly  from  C.  Let  also  p  be  a  fixed  point  in  the  plane,  not  on  any  of  these 
curves  and  not  vertically  aligned  with  any  vertex  of  the  arrangement  .4(i2).  We  use  17  for  the  random 
variable,  depending  on  R,  that  counts  the  number  of  curves  which  intersect  the  funnel  containing  p. 

As  described  in  the  previous  section,  we  triangulate  A{  R).  decomposing  each  cell  into  trapezoidal 
funnels.   We  are  interested  in  the  particular  funnel  Ar  that  contains  p.   As  observed  in  Section  5.1 
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every  funnel  in  the  triangulation  of  A(R)  is  a  funnel  in  the  triangulation  of  some  four  or  fewer  curves 
of  R.  This  observation  suggests  that  we  could  find  Ar  in  the  following  roundabout  way:  for  all 
R'  C  R  with  \R'\  <  4,  find  the  trapezoid  of  the  triangulation  of  A(R')  that  contains  p.  The  result 
is  a  set  of  0(r4)  trapezoids,  which  we  denote  Tr.  (Observe  that  Fr  is  a  set  which  implies  that 
it  contains  a  trapezoid  only  once,  even  if  it  is  defined  by  more  than  one  set  R'.)  We  know  that 
Afl  6  -Ffl,  and  by  the  independence  lemma  (Lemma  5.1)  A#  is  the  unique  trapezoid  in  7r  that  does 
not  meet  any  curves  of  R. 

To  continue  the  proof  define  Tc  like  Tr  so  that  it  contains  all  trapezoids  of  the  arrangement 
of  some  four  or  fewer  curves  of  C  that  contain  the  point  p.  Furthermore  let  |A|  be  the  number  of 
curves  of  C  that  meet  trapezoid  A,  and  let  P±  be  the  probability  that  A  is  A/?,  that  is, 

P±  —  Prob[A  =  A/?  under  the  assumption  that  p  £  A]. 

With  these  definitions  E[q]  =  ^Ae^c  l^l^-i-  To  show  that  E[q\  is  0(n/r),  we  will  determine  P\. 
Since  all  subsets  R  of  size  r  are  equally  Likely.  P\  is  the  number  of  subsets  R  such  that  A  =  \r 
divided  by  (™),  the  number  of  subsets  R  of  size  r.  We  can  thus  compute  P±  by  determining  how  many 
subsets  R  have  A  =  A/?.  By  the  independence  lemma,  the  following  two  conditions  are  necessary 
and  sufficient  for  A  to  be  Ar:  A  is  in  Tr  and  none  of  the  |AI  curves  meeting  A  are  in  R.  Recall 
that  A  is  in  the  triangulation  of  some  four  or  fewer  curves  of  C.  and  using  the  general  position 
assumptions  about  C,  there  is  a  set  R±  C  C  with  \R±\  =  r±  <  4  such  that  A  6  Tr  if  and  only 
if  R±  C  R.  To  put  A  in  !Fr  we  must  choose  these  r^  curves.  To  satisfy  the  second  condition,  the 
remaining  r  -  r±  curves  must  be  chosen  from  the  n  -  |A|  -  r^  curves  that  are  not  in  R±  and  do  not 
meet  A.  Any  such  choice  will  do,  so  we  have 


Now  since 


we  have 


^^ei<j*:?)/(;) 


To  complete  the  proof  of  the  estimate  for  E[q],  it  is  enough  to  show  that  the  sum  in  this  expression  is 
bounded  from  above  by  a  constant.  The  key  observation  here  is  that  the  summand,  !  A! 


iA,-ral 
-I 


A 


is  the  probability  that  A  is  a  trapezoid  in  Fr  that  meets  exactly  one  curve  of  R.  This  observation 
follows  from  an  argument  much  like  that  giving  P±:  after  choosing  the  r^  curves  defining  A.  we  have 
to  choose  exactly  one  curve  from  the  set  of  |AI  curves  meeting  A,  and  r  -  r±  -  1  of  the  remaining 
ones.  The  sum  is  the  expected  number  of  trapezoids  in  Fr  that  meet  exactly  one  curve  of  R.  By 
Lemma  5.2  (i)  the  expectation  is  at  most  four. 

This  completes  our  proof  that  for  any  point  pj  £  P, 

WlS^]-<(^j)a<4-(^)". 
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Step  2.  The  main  flow  of  the  argument  for  part  (ii)  of  the  lemma  is  the  same  as  in  Step  1.  We 
replace  J^r  and  Fc  by  Qr  and  Qc-  where  Qr  is  the  set  of  ail  trapezoids  defined  by  four  or  fewer  curves 
in  R,  and  Qc  IS  the  same  for  C.  Furthermore,  we  replace  P±  by  Q^,  where  Q±  is  the  probability 
that  A  €  Qc  is  a  funnel  in  the  triangulation  of  A( R).  Finally,  define  \r  =  *£,nt  where  the  sum  is 
taken  over  all  funnels  of  the  triangulation  of  A(R).  With  these  definition  we  have 

E[xr]  =    £     A!0A. 

We  can  compute  Q±  by  observing  that  A  is  a  funnel  in  the  triangulation  of  A(R)  if  and  oniv  if 
A  6  Qr  and  A  does  not  meet  any  curve  in  R.  So 


exactly  the  same  as  P±  only  that  Q±  is  defined  for  ali  trapezoids  in  Qc-  Thus  we  have 

n -  |A|  -  r^\    / ( n^ 


E[XR\  =   E'^^E   W 


Here,  the  summand  is  the  probability  that  A  is  a  trapezoid  in  Qr  that  meets  exactly  one  curve  in 
R.  Thus,  the  sum  on  the  right  side  is  the  expected  number  of  trapezoids  in  Qr  that  intersect  exactlv 
one  curve  in  R.  By  Lemma  5.2  (ii)  this  number  is  0(r2)  which  implies  part  (ii)  of  the  assertion.    ~ 

Remark.  The  following  stronger  upper  bounds  on  the  sums  in  the  sampling  lemma  are  implied 
by  the  above  proof  and  the  remark  after  Lemma  5.2.    £*_j  rn,n?   <  4am(^7)     holds  for  lines. 

pseudolines.  unit-circles,  general  circles,  and  pseudocodes.    £i=l  nt   <    ^~  is  true  for  lines  and 

pseudolines,  and  3I.=i  n«  <  717  holds  for  for  unit-circles,  general  circles,  and  pseudocodes. 

5.3     Upper  Bounds  on  Incidences  in  the  Plane 

Bounds  on  the  maximum  number  of  incidences  between  m  points  and  n  lines,  pseudolines,  unit- 
circles,  general  circles,  and  pseudocircles  have  been  established  in  Section  4  (Canham  Threshold 
4.2).  We  now  use  the  machinery  provided  in  Sections  5.1  and  5.2  (as  well  as  Canham  Threshold  4.2 
itself)  to  improve  these  bounds  for  a  wide  range  of  values  of  m  depending  on  n.  In  each  case  we 
start  with  a  sample  of  the  curves,  triangulate  the  sample  arrangement,  and  finally  do  the  necessary 
calculations  to  obtain  the  result.  Beyond  the  fact  that  the  argument  is  the  same  in  each  case,  even 
the  calculations  are  identical  for  lines,  pseudolines.  and  unit-circles  and  for  circles  and  pseudocircles 
since  the  Canham  thresholds,  the  triangulation  results,  and  the  probabilistic  counting  bounds  are 
asymptotically  the  same.  In  view  of  the  fact  that  the  argument  and  the  calculation  of  the  final 
result  for  lines  has  been  given  in  Section  3,  we  can  as  well  omit  the  discussion  of  the  first  three  cases 
altogether.  The  remainder  of  this  section  gives  the  complete  derivation  of  the  improved  bounds  for 
circles  and  pseudocircles  and  finally  summarizes  the  results.  We  phrase  the  analysis  for  circles. 

To  get  the  analysis  for  circles  started  we  randomly  choose  r  of  the  a  circles  and  triangulate  the 
arrangement  defined  by  the  r  circles.  It  consists  of  k  =  0(r2)  funnels  which  we  index  from  L  through 
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k.  Let  m,  be  the  number  of  points  in  the  ith  funnel  and  rc,  be  the  number  of  circles  that  intersect  the 
ith  funnel25.  If  I{m.  n)  denotes  the  maximum  number  of  incidences  between  m  points  and  n  circles, 
we  have 

k 

I(m,  n)  <  ^  I{mt,  nr)  -r  m  +  2nr. 

t=i 

Here  the  sum  covers  the  number  of  incidences  within  the  funnels  and  the  term  2nr  -j-  m  bounds  the 

number  of  incidences  between  points  that  lie  on  sample  circles  and  all  n  given  circles.  Indeed,  each 

circle  c  intersects  the  sample  circles  (other  than  c  if  it  is  itself  in  the  sample)  in  at  most  2r  points, 

so  it  can  contribute  at  most  2r  incidences  with  points  that  lie  on  (other)  sample  circles.   The  only 

incidences  that  are  not  accounted  for  by  this  argument  involve  incidences  between  sample  circles 

and  points  that  are  not  vertices  of  the  sample  arrangement,  but  there  are  at  most  m  such  incidences 

total.  The  probabilistic  counting  argument  in  Section  5.2  implies  that  there  is  a  sample  so  that  the 

average  n,  is  0(n/r)  and  £f=1  mxnil     =  0(m(rc/r)2/'3).    Using  Canham  Threshold  4.2(ii)  we  thus 

get 

k 

I(m,  n)  =  y    0(mtn"      -j-  nt)  -  m  +■  2nr  =  0[  mn"   r     '    ±m  +  nr). 

■=i 

We  now  choose  r  —  0(m3'5n-l/'5)  which  is  feasible  as  long  as  n1^3  <  m  <  n2  and  obtain 

I(m.n)  =  0(m3/5n4/s) 

for  this  range  of  m.  We  have  I(m,n)  =  0(n)  if  m  <  nl^z  and  /(m,  n)  -  0(m)  if  m  >  rr .  A 
combination  of  these  bounds  gives  the  desired  result.  The  above  analysis  applies  almost  verbatim  to 
the  case  of  lines,  pseudolines,  and  unit-circles,  the  main  difference  being  that  we  have  to  use  Canham 
Threshold  4.2(i)  in  which  case  the  choice  of  r  is  as  in  Section  3.  We  thus  have 

Theorem  5.4  (planar  incidence  theorem).  The  maximum  number  of  incidences  between  a  set  M 
of  m  points  and  a  set  .V  of  n  curves  is 

(i)  0(m2/'3n2/3  ■+-  m  +■  n)  if  N  Is  a  set  of  lines,  pseudolines,  or  unit-circles,  and 
(ii)  0(m3/5n4/s  ■+-  m  -f  n)  if  N  is  a  set  of  n  circles  or  pseudocodes.26 

Remarks.  (1)  The  bound  for  lines  is  not  new  and  is  originally  due  to  Szemeredi  and  Trotter  !56j 
who  also  give  a  number  of  applications  to  related  geometric  problems.  This  bound  is  tight. 

(2)  Also  the  upper  bound  for  unit-circles  is  not  new  and  can  be  found  in  [54].  In  contrast  to  the 
case  of  lines,  no  matching  lower  bound  is  known  and  in  fact  it  is  conjectured  that  the  bound  in  (i)  is 
not  tight  (see  Erdos  [24]).  It  is  worthwhile  to  remark  that  the  same  bound,  0(m2/3n2/3  -r  m-t-n),  also 


25 Recall  that  a  cell  as  well  as  a  funnel  is  an  open  set.  Since  we  can  rotate  a  configuration  of  circles  and  points 
arbitrarily  we  may  assume  that  no  point  lies  on  a  vertical  side  of  a  funnel.  Thus,  if  a  point  lies  on  the  boundary  of  a 
funnel  it  must  necessarily  lie  on  at  least  one  sample  circle. 

8  It  is  implicit  that  we  make  the  claim  only  for  pseudolines  that  meet  any  vertical  line  in  one  point  and  for  pseudoco- 
des that  meet  any  vertical  line  in  at  most  two  points.  Since  any  pseudoline  arrangement  is  combinatorially  equivalent 
to  one  where  all  pseudolines  satisfy  this  condition  -  which  can  be  shown  e.g.  by  using  circular  sequences  ;32,  13j  -  this 
is  no  loss  of  generality  in  the  case  of  pseudolines. 
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holds  in  the  more  general  case  when  at  most  a  constant  number  of  circles  intersect  in  two  common 
points  (see  remark  (4)  after  Canham  Threshold  4.2). 

(3)  The  upper  bound  for  circles  is  new  and  is  an  improvement  of  the  0(  m3  4n3  *  -  m  -  n)  bound 
observed  in  5.  10  .  It  can  be  used  to  improve  known  bounds  for  combinatorial  distance  problems 
in  the  plane  (see  Section  5.4).  As  for  unit-circles,  no  matching  lower  bound  is  known  and  it  seems 
unlikely  that  our  bound  is  tight.  The  example  of  m  points  and  n  Lines  with  Q(m2  3n2  3)  incidences 
can  be  used  to  show  the  same  lower  bound  for  circles  as  follows.  Choose  an  origin  disjoint  from  all 
points  and  Lines  and  perform  an  inversion  with  respect  to  this  origin.  This  transform  maps  every 
point  to  a  point  and  every  line  to  a  circle,  and  it  preserves  incidences.  The  weakness  of  this  lower 
bound  example  is  that  every  circle  contains  the  origin  which  is  an  indication  that  not  all  degrees  of 
freedom  are  properly  used.  Indeed,  in  the  next  section  we  will  see  a  construction  that  gives  a  slightly 
higher  lower  bound  for  a  small  range  of  m. 

(4)  A  stereographic  projection  of  a  sphere  onto  a  plane  maps  a  point  to  a  point  and  a  circle 
to  a  circle  (or  a  Line  if  the  circle  passes  through  the  center  of  the  projection).  This  fact  can  be 
used  to  extend  the  planar  incidence  theorem  to  the  case  where  the  points  and  circles  Lie  on  the 
surface  of  a  sphere  in  three  dimensions.  Consequently,  for  m  points  and  n  circles  on  the  sphere  we 
have  0(  m3' 'n4'5  —  m  —  n  i  as  an  upper  bound  on  the  number  of  incidences  in  the  general  case  and 
0(m:  3n:  3  -  m  -  n)  if  no  three  circles  intersect  in  two  common  points.  The  latter  condition  is 
satisfied  for  example  if  the  circles  are  all  congruent  and  are  not  great-circles  of  the  sphere.  If  the  n 
circles  are  great-circles,  then  the  second  bound  still  holds  since  two  circles  can  intersect  in  only  one 
point  within  any  open  hemisphere. 

(5)  The  planar  incidence  theorem  can  be  restated  to  bound  the  number  of  curves  that  contain 
at  least  some  number  of  points  and  the  number  of  points  contained  in  at  least  some  number  of 
curves:  Given  a  set  of  m  points,  the  maximum  number  of  lines  (pseudolines,  unit-circles  i  containing 
at  least  k  >  2  points  each  is  0(  p —  p).  Symmetrically,  given  n  lines,  pseudolines.  or  unit-circles, 
the  maximum  number  of  points  incident  to  at  least  k  >  2  of  them  is  0(  p-  -  p).  The  corresponding 
bounds  for  circles  and  pseudocodes  are  0(  p —  ^ )  and  0(7777  ~  £  )• 

5.4      Applications  to  Distance  Problems 

As  observed  by  Spencer,  Szemeredi  and  Trotter  [54],  the  maximum  number  of  unit-distance  pairs: 
in  a  set  of  m  points  in  the  plane  is  at  most  half  the  maximum  number  of  incidences  between  m 
points  and  m  unit-circles.  To  see  this  draw  a  unit-circle  around  each  point  and  observe  that  {p.  q} 
is  a  unit-distance  pair  if  and  only  if  p  lies  on  the  circle  around  q  and  q  Lies  on  the  circle  around  p. 
It  follows  that  0(m4/3)  is  an  upper  bound  for  the  maximum  number  of  unit-distance  pairs.  This 
result  is  not  new  but  the  proof  Ln  this  paper  is  considerably  simpler  than  the  one  in  54  and  the 
multiplicative  constant  implied  by  our  proof  is  significantly  smaller  than  theirs.  Below  we  derive 
such  an  improved  constant. 

We  start  with  considering  the  incidence  problem  for  m  points  and  n  unit-circles  in  the  plane. 
Since  no  two  points  are  incident  to  three  unit-circles  (the  corresponding  bipartite  digraph  has  no 


{p.  <j}  u  •»  unit-durance  pair  if  dip.  q)  =  1. 
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A'2.3)  we  get  I(m,n)  <  2i;2mn1'2  +  n  from  remark  (1)  after  the  bipartite  graph  lemma  (Lemma 
4.1).  From  the  remark  after  the  sampling  lemma  (Lemma  5.3)  we  get  23/2m(^)  *  as  an  upper 
bound  for  the  expected  value  of  £l=l  2l'2mlni  "  and  j^-  as  an  upper  bound  for  the  expected  value 
of  3Z T=  l  a'-  ft"  we  ta^e  both  Dounds  times  2  we  can  be  sure  that  there  is  a  sample  for  which  the 
(increased)  bounds  are  true.  So  we  get 

* — ^     1/1        1 /?  Kit     1     n     \i/2       8nr~ 

I(m,n)  <  >   (21/2mtn,/    +  rn)  +  2nr  4-  m  <  2o/2m 4-  4-  2nr  4-  m. 

—  £—*  v  r  -  4  r  -  4 

1=1 

Now  choose  r  =  cm3/3n-1/3  -  c',  with  c  =  7^7  and  4  <  c'  <  5  so  that  r  is  an  integer.  Thus,  we  get 
I{m,  n)  <  8v/5m2/V/3  +■  90n  +  m  +  100^25u4/3m-2/3. 

If  we  set  m  =  a  we  get 

f(m,  n)  <  8v1m4/3  4-  91n  -  100  v^m273. 

Since  the  number  of  unit-distance  pairs  is  at  most  half  this  quantity  we  have  the  following  result. 

Theorem  5.5  The  maximum  number  of  unit-distance  pairs  in  a  set  of  m  points  in  the  plane  is 
4v/5m4/3  -rO(m). 

The  question  of  how  often  a  given  distance  can  occur  in  a  set  of  points  has  also  been  asked  for 
m  points  that  lie  on  a  sphere  in  three  dimensions  (see  Erdos,  Hickerson  and  Pach  [29] ) .  They  use  a 
geometric  transform  to  show  that  the  maximum  number  of  unit-distance  pairs  in  a  set  of  m  points  on 
a  sphere  in  three  dimensions  is  f2(m4/3).  A  matching  upper  bound  follows  from  remark  (4)  after  the 
planar  incidence  theorem  (Theorem  5.4).  The  example  in  [29]  consists  of  pairs  at  distance  exactly 
one  fourth  of  a  great-circle,  and  it  seems  likely  that  this  is  the  only  distance  which  can  occur  fi(m4/3) 
times.  The  best  lower  bound  for  other  distances  is  fi(mlog*  m)  [29). 

Different  Distances  in  the  Plane.  The  remainder  of  this  section  discusses  a  problem  about 
different  distances.  For  a  point  set  P  =  {pi,P2,  •  •  -iPm}  let  gi  be  the  number  of  different  distances 
from  pi,  that  is,  gt  =  \{d(pi,Pj)\l  <  j  <  m,  j  ^  i}\.  We  define  g(P)  =  3ZHi  !7»  an<J- 

g(m)  =  min{<7(P)|P  a  set  of  m  points  in  the  plane}. 

Erdos  [27]  conjectures  g(m)  =  fi(m2(logm)-1//2)  but  proves  only  n(m3/2).  The  upper  bound  on  the 
number  of  incidences  between  m  points  and  n  circles  given  in  i5,  101  implies  g(m)  =  fifm3'3).  We 
improve  this  bound  using  the  planar  incidence  theorem.  Around  each  p,  draw  gx  circles  that  contain 
the  other  points.  This  gives  2(T)  incidences  between  m  points  and  g(P)  circles.  By  the  planar 
incidence  theorem  (ii)  the  number  of  incidences  between  that  many  points  and  circles  is  bounded  by 
O(m3/50(P)4/5)  which  implies  m3/5g(P)4/5  =  n(m2)  for  any  set  P  of  m  points  in  the  plane.  Thus 
we  have  the  following  result. 

Corollary  5.6  g(m)  =  f2(m7/4). 
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As  a  consequence  of  Corollary  5.6,  the  average  gl  and  therefore  also  the  maximum  <?,  is  Q(m3  4). 
always. 

Remarks.  (1)  The  above  lower  bound  on  g\m)  is  new,  but  that  the  maximum  gx  is  fifm3"1)  was 
known  before  (although  only  in  unpublished  form:  see  23- ).  Recently,  Chung,  Szemeredi.  and  Trotter 
claimed  to  have  improved  this  bound  to  fi(m4'5). 

(2)  Let  us  indulge  in  an  instructive  exercise  and  assume  that  we  can  prove  that  the  maximum 
number  of  incidences  between  m  points  and  n  circles  is  0(m2^3n2/3  -m  +  n)  (which  we  cannot).  This 
would  imply  that  m2/3^(m)2/3  =  flfm2)  and  therefore  g(m)  =  fl(m2).  Consequently,  the  maximum 
<7,  would  be  linear  in  m.  This  is  indeed  false  as  Erdos  24]  proves  that  the  number  of  different 
distances  in  the  ^Tn  by  v'7n  grid  is  0(  m(logm)~l/2).  We  conclude  that  the  maximum  number  of 
incidences  between  m  points  and  m:(log  m)~1' 2  circles  is  fi(m2).  In  terms  of  n  =  m:(  log  mri:. 
the  number  of  circles,  the  maximum  number  of  incidences  between  m  =  nl /2(log  n)1  4  points  and 
n  circles  is  fi(rj(log  n)l/2).  This  improves  the  lower  bound  on  the  maximum  number  of  incidences 
between  m  points  and  n  circles  mentioned  in  remark  (3)  after  the  planar  incidence  theorem  for 
n1/2(log  n)1^  <  m  <  n1^2(log  n)3/A.  Of  course,,  the  lower  bound  on  the  number  of  incidences  derived 
in  this  paragraph  implies  the  same  lower  bound  for  the  many  faces  problem  for  circles. 

(3)  Remark  (2)  shows  that  g{m)  =  Q(m2)  if  0(m2/3n2/'3  4-  m  +■  n)  is  an  upper  bound  on  the 
number  of  incidences  between  the  m  points  and  the  g(P)  =  n  circles  whose  centers  are  the  m  points. 
This  bound  holds  if  the  maximum  number  of  circles  intersecting  in  two  common  points  is  at  most 
some  constant  which  is  true  if  the  maximum  number  of  collinear  points  in  P  is  bounded  by  some 
constant.  This  result  was  originally  obtained  by  Szemeredi  in  a  direct  and  elegant  manner  i  see  Erdos 
[27]). 

5.5      The  Complexity  of  Zones 

The  zone  of  a  manifold  in  an  arrangement  is  the  collection  of  cells  of  the  arrangement  crossed  by  the 
manifold.  We  need  estimates  for  the  number  of  faces  bounding  the  cells  of  a  zone,  the  (combinatorial  I 
complexity  of  a  zone,  in  order  to  take  care  of  the  cells  that  spill  outside  of  their  subproblem  funnel. 

In  18]  upper  bounds  on  the  complexity  of  zones  for  rather  general  curves  in  the  plane  are  derived. 
Their  main  theorem  establishes  upper  bounds  on  the  complexity  of  a  zone  in  an  arrangement  of  simple 
curves28  that  are  only  slightly  superlinear  in  the  number  of  curves.  These  bounds  apply  as  long  as 
there  is  some  constant  such  that  any  two  curves  intersect  in  at  most  this  constant  number  of  points. 
We  state  their  result  after  reviewing  a  related  combinatorial  concept. 

The  bounds  on  the  complexity  of  zones  are  given  in  terms  of  maximum  length  functions  of 
so-called  Davenport- Schinzel  sequences.  These  are  sequences  of  symbols  so  that  no  two  adjacent 
symbols  are  the  same  and  there  are  no  long  alternations  of  any  two  symbols.  More  specifically. 
a  sequence  aid*... at,  with  a,  £  {1,2.  ...,n}  for  1  <  t  <  Jb,  is  a  Davenport-Schinzel  sequence  of 
order  s  (for  short  an  (n,  a) -sequence)  if  a,  £  a;+1  for  1  <  i  <  k  -  1  and  there  are  no  s  -  2  indices 
it  <  »2  <  ...  <  i#+2  so  that  aM    =  a,,   =  ...  j£  a,,   =  au   =  ...      Let  A, (a)  be  the  length  of  the 


21  In  this  paper  *(  define  a  itmpie  curve  as  the  image  of  an  interval  or  circle  under  a  continuous  one-to-one  mapping 
to  the  Euclidean  plane.  For  example,  a  pseudoline  is  a  simple  curve  that  is  unbounded  at  both  ends  and  must  therefore 
be  the  image  of  an  open  interval. 
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longest  (n,  s)-sequence.  Then  the  following  bounds  on  \,(n)  are  known,  where  a(n)  is  the  inverse  of 
Ackermann's  function  which  is  notorious  for  growing  extremely  slowly. 

\i(n)  =  n      and      ^(n)  =  2n  —  1, 
A3(n)  =  0(na(n))      (see  [36]), 

A4(n)  =  0(rc.2a<">)      (see  [1]), 
\2l(n)  =  n  ■  2e(^"*"i)      (see  [1]),  and 
\2,+l(n)  =  n.a(nf^n)'-l)      (see  [l]). 

Besides  bounding  the  complexity  of  zones,  bounds  on  the  maximum  length  of  Davenport-Schinzel 
sequences  will  also  be  used  to  analyze  triangulations  of  sphere  arrangements  in  space  (see  Section 
6.3). 

The  theorem  below  summarizes  the  known  results  on  the  complexity  of  zones. 

Theorem  5.7  (zone  theorem).  Let  T  U  {7}  be  a  set  of  n  +  1  simple  curves  in  the  plane  with  the 
property  that  any  two  intersect  in  at  most  3  points  where  they  cross. 

(i)  The  complexity  of  the  zone  of  7  in  A(T)  is  0(A,+2(ra)). 

(ii)  If  T  U  {7}  is  a  set  of  lines  or  pseudolines  then  the  complexity  of  the  zone  of  7  is  0(n). 
(iii)  If  T  is  a  set  of  unit-circles  and  7  is  a  line  or  a  circle,  then  the  zone  of  7  has  complexity  0(  A3(  n)). 

(Parts  (i)  and  (iii)  of  the  above  theorem  are  taken  from  [18].  Part  (ii)  dates  back  to  [8,  21.;  proofs 
can  also  be  found  in  [15,  18].) 

Remark.  Currently,  there  is  only  one  non-trivial  extension  of  the  zone  theorem  known,  namely  for 
hyperplanes  in  d  >  3  dimensions.  The  result  is  that  the  zone  of  a  hyperplane  in  an  arrangement  of 
n  other  hyperplanes  is  bounded  by  at  most  0(nd~l)  faces  of  any  dimension  [21].  The  lack  of  such 
results  for  other  manifolds  is  the  main  reason  for  our  lack  of  non-trivial  bounds  on  the  number  of 
faces  bounding  many  cells  in  three-  and  higher-dimensional  arrangements.  Bounds  for  pianes  and 
hyperplanes  can  be  found  in  [19]. 

5.6     Upper  Bounds  on  the  Complexity  of  Many  Cells 

This  section  derives  our  final  bounds  on  the  maximum  number  of  edges  bounding  m  cells  in  an 
arrangement  of  n  curves.  As  in  earlier  sections  we  consider  lines,  pseudolines,  unit-circles,  (general) 
circles,  and  pseudocodes29.  In  each  case  the  proof  follows  a  modular  structure,  and  to  go  from  one 
case  to  another  only  requires  the  adjustment  of  some  parameters.  As  explained  in  Section  3.  the 
major  steps  of  the  proof  are 


"Recall  that  we  assume  that  a  pseudoline  intersects  any  vertical  line  in  a  point  and  that  a  pseudocode  intersects 
any  vertical  Line  in  at  most  two  points. 
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(1)  the  establishment  of  a  Canham  threshold, 

(2)  the  triangulation  of  the  arrangement  denned  by  a  sample  of  the  curves. 

(3)  the  analysis  of  the  triangulation  by  probabilistic  counting, 

(4)  the  accounting  done  by  divide-and-conquer. 

(5)  the  analysis  of  the  combinatorial  complexity  of  zones,  and 

(6)  the  calculation  of  the  final  bound. 

Steps  (1),  (2),  and  (3)  were  discussed  in  appropriate  detail  in  Sections  4,  5.1,  and  5.2.  To  facilitate 
step  (4)  we  mark  each  cell  by  a  point  somewhere  within  the  cell.  Using  the  appropriate  Canham 
threshold  we  bound  the  number  of  edges  bounding  cells  that  are  marked  within  the  funnels:  this 
covers  all  cells  that  do  not  cross  the  border  between  funnels.  Now  we  add  the  complexity  of  the  inner 
zones  of  the  funnels  to  cover  all  edges  that  were  not  counted  in  step  (4);  these  edges  bound  cells  that 
reach  into  more  than  one  funnel.  Finally,  we  wrap  it  up  by  choosing  the  sample  size  to  optimize  the 
result. 

The  remainder  of  this  section  discusses  all  cases  in  turn  and  finally  summarizes  the  results. 

Lines  and  Pseudolines.  Section  3  proved  K(m.  n)  -  0(m;''37i2''3 ~n)  for  m  cells  in  an  arrangement 
of  n  lines.  Because  of  our  assumption  that  a  pseudoline  does  not  have  any  ri-extreme  points  all 
steps  are  exactly  the  same  as  for  lines.  We  thus  obtain  the  same  bound  for  pseudolines. 

Unit-circles.  The  major  difference  between  the  proofs  for  lines  and  for  unit-circles  is  step  (o).  the 
complexity  of  zones.  To  estimate  the  contribution  of  the  inner  zones  in  the  arrangement  of  sampled 
unit-circles  we  need  bounds  on  the  complexity  of  the  zone  of  a  unit-circle  and  the  zone  of  a  line  in 
an  arrangement  of  n  unit-circles30.  In  both  cases  we  talk  about  curves  so  that  any  two  intersect  in 
at  most  two  points.  By  Theorem  5.7  (iii),  the  number  of  edges  bounding  all  zone  cells  is  Oi  \3tr1)). 
We  can  now  do  the  necessary  calculations  to  obtain  a  good  bound  on  A'(m.  n). 

Let  U  be  a  set  of  n  unit-circles  and  let  R  Z  U  be  a  sample  of  size  r  so  that  31  a  m<a,  = 
0(m(~)1^2)  and  £•  nx  =  O(rn).  Here,  m,  is  the  number  of  points  in  funnel  A,  and  n.  is  the 
number  of  unit-circles  intersecting  A,.  I'sing  part  (iii)  of  the  zone  theorem  we  get 


K{m,n)  =  £o(miflf1/l  +  A3(flf))  =  0(m(  -)W2  -  rna(n)). 


We  choose  r  =  Q{m.2^3n~x^3a(n)  2/3),  which  is  meaningful  as  long  as  m  >  n1  :ain).  With  this 
choice  of  r  we  get 

K{m,n)  =  0(m2/V/3a(n)1'3). 

For  the  case  m  <  n1/2a(n)  we  still  have  Canham  Threshold  4.7  which  gives  us  the  bound  Ol  mn'  :  - 
n)  =  0(m:/3n2/'3a(n)1^3  +  n),  thus  covering  all  cases. 

Circles  and  pseudocodes.  The  arguments  for  pseudocodes  and  circles  are  identical  which  allows 
us  to  ignore  pseudocodes  altogether.  In  contrast  to  unit-circles  we  now  have  to  deal  with  a  weaker 
Canham  threshold  which  bounds  the  number  of  edges  of  m  cells  only  by  0(  mn2  3  -  n),  see  Section 
4.4.   All  the  other  steps  are  the  same  as  for  unit-circles,  except  for  the  complexity  of  a  zone  which 


Since  a  cell  can  reach  from  one  funnel  to  another  oniv  by  intersecting  the  vertical  ude  separating  the  two  funnels 
it  is  actually  sufficient  to  consider  only  zones  of  lines  in  urut-circle  arrangements. 
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is  now  0(A4(r))  and  that  we  need  a  sample  R  of  size  r  so  that  ^A  m.rc,  =  0(m(-)2'3)  and 
Ha,  n'  =  0(rn).  Such  an  R  exists  by  the  results  in  Section  5.2.  We  thus  get 

K(m,n)  =  £  0(mtrI;/3  -  A4(rct))  =  0(m(  -)2/3  +  m^l). 
a,  r  n 

We  define  3e(n)  =  ^^  and  choose  r  =  0(m3/5n-1/5  jc(n)-3/5),  which  is  possible  if  m  >  nl'33c{n). 
This  choice  of  r  leads  to 

A-(m,n)  =  0(m3/5n4/5A3c(n)2/5). 

For  m  <  nl/3,3c(n)  the  Canham  threshold  for  circles  can  be  used  to  get  the  bound  0(mn2/3  fn)  = 
0(m3/5n4/,5^c(n)2/5  +  n)  which  thus  covers  all  cases. 

We  finally  summarize  the  results  of  this  section. 

Theorem  5.8  (planar  many  edges  theorem)  The  maximum  number  of  edges  bounding  a  set  of  m 
cells  in  an  arrangement  A{  N )  of  n  curves  is 

(i)  Q(m2^3n2^3  -f  n)  if  .V  is  a  set  of  lines  or  pseudolines, 
(ii)   0(m2/3n2/3a(n)i<'3  +■  n)  if  N  is  a  set  of  unit-circles,  and 
(iii)  0(m3/5n4/5Jc(n)2/5  +  n)  if  .V  is  a  set  of  circles  or  pseudocodes. 

Remarks.  (1)  For  n1,2  <  m  <  nl/2a(n)  the  bound  in  (ii)  can  be  improved  to  0(mn1/2)  using 
Canham  Threshold  4.7.  Similarly,  for  n1'3  <  m  <  nl''33c(n)  the  bound  in  (iii)  can  be  improved  to 
0(mn2/3)  using  Canham  Threshold  4.6. 

(2)  The  upper  bound  for  unit-circles,  0(m2/,3n2/3a(rc)1//3  —  n),  is  almost  tight  which  can  be  seen 
by  extending  the  fl(m2,3n2^z  +•  n)  lower  bound  for  lines  to  unit-circles:  take  a  line  arrangement  with 
m  cells  realizing  the  maximum  and  approximate  each  line  by  a  large  enough  "unit -circle"  so  that 
no  edges  of  the  m  cells  are  lost.  It  would  be  interesting  to  see  whether  or  not  the  a(n)-factor  is  an 
artifact  of  the  proof  technique. 

(3)  No  matching  lower  bound  for  the  case  of  circles  and  pseudocodes  is  currently  known.  Except 
for  a  small  range  of  values  of  m  relative  to  n  (see  remark  (2)  after  Corollary  5.6)  the  best  known 
lower  bound  is  the  same  as  for  unit-circles. 


6      Spheres  in  Three  Dimensions 

This  section  demonstrates  that  the  techniques  laid  out  in  Sections  4  and  5  can  be  used  to  derive 
bounds  on  the  maximum  number  of  incidences  between  m  points  and  n  spheres  in  three  dimensions 
which  improve  upon  previously  known  bounds.  This  problem  is  interesting  because  various  com- 
binatorial distance  problems  for  points  in  three  dimensions  can  be  rephrased  as  point  and  sphere 
incidence  problems.  This  relationship  will  be  discussed  in  Section  6.5. 
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Observe,  however,  that  we  cannot  expect  a  non-trivial  upper  bound  for  the  general  problem  since 
we  can  have  all  n  spheres  intersecting  in  a  common  circle  and  choose  m  points  on  this  circle.  This 
gives  mn  incidences  which  matches  the  trivial  upper  bound.  We  get  interesting  problems  onlv  after 
imposing  some  restrictions  on  the  spheres  or  points.  Section  6.1  imposes  certain  general  position 
conditions  on  the  spheres  (and/or  points).  This  line  of  investigation  will  lead  to  a  new  upper  bound 
on  the  maximum  number  of  unit-distances  of  a  set  of  n  points  in  three  dimensions.  In  Section  6.2 
we  discuss  the  case  where  the  points  are  required  to  be  vertices  of  the  arrangement  defined  by  the 
spheres.  For  this  case  we  need  to  prove  an  extension  of  the  bipartite  graph  lemma  (Lemma  4.1) 
which  is  of  independent  interest. 

All  steps  of  the  general  proof  scheme  developed  in  Section  5  go  through  without  major  difficulties 
except  for  the  decomposition  of  sphere  arrangements  into  cells  of  constant  description,  the  funnels. 
This  step  is  treated  in  Section  6.3.  The  results  are  summarized  in  Section  6.4  and  applications  to 
combinatorial  distance  problems  in  three  dimensions  are  discussed  in  Section  6.5. 

The  reason  for  concentrating  on  spheres,  rather  than  on  planes,  say,  is  that  our  techniques  do 
not  yield  the  best  bounds  for  point  plane  incidences  and  for  the  complexity  of  cells  in  arrangements 
of  planes.  These  are  best  handled  by  dual  techniques  as  demonstrated  in  the  companion  paper  '19'. 

6.1      Canharn-like  Bounds  under  General  Position  Assumptions 

Assume  that  .V,  the  set  of  n  spheres,  contains  no  three  spheres  that  intersect  in  a  common  circle. 
Three  points  in  three  dimensions  define  a  unique  circle  through  them,  unless  they  are  collinear  in 
which  case  no  sphere  can  contain  all  three.  This  implies  that  any  three  points  of  the  given  set  .1/ 
belong  to  at  most  two  common  spheres.  If  we  consider  Q  =  (MiN.  A)  with  (n.u)  £  A  if  /1  €  v  this 
is  equivalent  to  stating  that  Q  contains  no  A'3,3.  (Observe,  however,  that  Q  can  contain  AS.t  for  t  up 
to  n  and  A',.:  for  3  up  to  m.)  Using  the  bipartite  graph  lemma  we  immediately  get  an  upper  bound 
on  the  number  of  incidences. 

Canham  Threshold  6.1  The  maximum  number  of  incidences  between  m  points  and  n  spheres  in 
three  dimensions  (assuming  no  three  spheres  intersect  in  a  common  circle)  is  0(mn2  3  -  n)  and 
0(nm:''3  +■  m).  v 

Remarks.  (1)  The  first  bound  in  Canham  Threshold  6.1  implies  that  the  number  of  incidences  is 
O(n)  as  long  as  m  <  nl/3. 

(2)  Natural  cases  where  the  assumption  in  Canham  Threshold  6.1  is  satisfied  include  unit-spheres 
and  spheres  with  no  three  collinear  centers.  If  we  relax  the  condition  on  the  spheres  and  require  that 
no  t  spheres  intersect  in  a  common  circle,  for  some  constant  t,  we  still  get  the  first  upper  bound  of 
Canham  Threshold  6.1. 

(3)  Suppose  we  require  that  no  four  points  in  M  are  cocircular  and  drop  any  conditions  on  .V.  In 
this  case  the  graph  Q  that  reflects  the  incidences  contains  no  A'4,;  which  implies  that  the  maximum 
aumber  of  incidences  is 

0(mn4/5  f  n)   and   0(nml/2  -  m). 
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Observe  that  the  next  natural  but  more  restrictive  condition  on  the  points  (no  five  points  cospherical) 
trivializes  the  problem  —  in  this  case,  each  sphere  can  account  for  at  most  four  incidences. 

6.2      Vertices  in  Sphere  Arrangements 

In  this  section  we  assume  that  .V  is  a  set  of  n  spheres  in  three  dimensions  whose  arrangement  has 
at  least  m  vertices,  and  M  is  a  subset  of  size  m  of  the  set  of  vertices.  While  the  ramification  for 
restricting  the  general  position  of  spheres  (as  considered  in  Section  6.1)  consists  of  applications  to 
counting  distances  in  three  dimensions,  the  reason  why  choosing  vertices  only  is  interesting  is  that 
this  problem  comes  close  to  counting  faces  bounding  m  cells.  No  good  bounds  for  this  many  faces 
problem  are  known. 

Before  we  state  and  prove  an  extension  of  the  bipartite  graph  lemma  let  us  understand  why  this 
lemma  is  not  sufficient  to  yield  any  interesting  results  for  the  problem  at  hand.  For  one  thing,  an 
arbitrary  number  of  spheres  can  intersect  in  a  common  circle,  and  up  to  2n  -  4  vertices  can  be 
cocircular31.  It  is  thus  easy  to  construct  fi(mn)  incidences  as  long  as  m  <  n.  This  matches  the 
trivial  upper  bound.  We  see  that  the  problem  is  interesting  only  if  m  is  superlinear  in  n.  As  far  as 
the  bipartite  graph  lemma  is  concerned,  for  every  constant  s  (or  t)  we  cannot  find  a  t  (or  s)  that  is 
sublinear  in  n  so  that  no  s  points  are  incident  to  t  common  spheres.  So  even  the  version  (mentioned 
in  remark  (1)  after  its  proof)  that  treats  3  and  t  as  variables  rather  than  constants  does  not  generate 
any  reasonable  bounds. 

Here  is  why  there  is  still  hope  to  prove  some  non-trivial  bounds  using  a  forbidden  K,,t  argument. 
Suppose  a  sphere  vx  contains  n<  (much  larger  than  n)  of  the  points  in  A/.  Such  a  sphere  must  exist 
unless  the  number  of  incidences  is  only  0(n2).  The  number  of  cocircular  4-tuples  amongst  the  n, 
points  is  at  most  O(nfn)  (because  they  are  vertices  of  the  arrangement)  which  leaves  a  huge  number 
of  non-cocircular  4-tuples.  But  each  such  4-tuple  belongs  to  at  most  one  sphere  and  thus  se%-erely 
limits  the  number  of  possible  incidences. 

We  next  prove  a  generalization  of  the  bipartite  graph  lemma  which  captures  the  idea  indicated 
above  in  combinatorial  terms.  After  that  we  return  to  the  vertex  and  sphere  incidence  problem.  We 
use  •'5-tuple*'  as  a  synonym  for  "set  of  size  3". 

Lemma  6.2  (extended  bipartite  graph  lemma).  Let  Q  =  (M'JN,  A)  be  a  bipartite  digraph  with 
m  -  |A/|  sources  and  n  —  \N\  sinks.  Let  3  and  t  be  constants  and  r  some  number  so  that  for  any 
subset  \I  C  M  there  are  at  least  (\M\  -  r)5 / 3!  3-tuples  in  \I  with  the  property  that  all  3  nodes  in 
every  such  j-tuple  have  at  most  t  -  1  common  adjacent  nodes  in  .V.  Then  \A\  —  0(mnl~l/'  —  m). 

Proof.  As  in  the  proof  of  the  bipartite  graph  lemma  we  consider  an  a-regular  multi-hypergraph 
7i  =  (A/,  H)  —  it  is  constructed  as  follows: 

for  every  u  6  N  adjacent  to  /ii, /X2, . . .,  m»  6  A/  we  add  {n\,pi, . .  .,^,}  to  H  —  unless 


il2n  —  4  cocircular  points  can  be  constructed  as  follows:  take  the  circle  of  intersection  of  two  spheres  and  choose  the 
other  n  -  2  spheres  so  that  each  cuts  the  circle  in  two  unique  points.  No  larger  number  of  cocircular  points  can  be 
constructed  because  each  point  must  lie  in  at  least  three  spheres. 
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the  3  nodes  in  M  are  adjacent  to  more  than  t  -  1  common  nodes  in  N  (in  which  case 
{fi\,  m,  ■  ■  .,n,}  is  not  added  to  H). 

H  contains  at  most  t  -  1  copies  of  every  possible  s-tuple  which  implies    H "  <  (t  -  l)(m). 

Define  a,  as  the  number  of  nodes  in  A/  adjacent  to  v,  £  :V,  for  I  <  i  <  n.    By  assumption.  i/x 
gives  rise  to  at  least  (n,  -  r)s/sl  3-tuples  in  H.  (Of  course,  this  statement  is  void  if  a,  <  r.)   This 

implies 


■m  , 
1=1 

where  31  *  indicates  that  we  sum  only  over  those  indices  i  for  which  n,  >  r.  Using  the  Holder- 
Minkowsky  inequality  we  thus  get 

n  n 

£fii  <  m  +  ^'  (m  -  r)  <  rn  +  (t  -  L)x/*mnl"l/j 

1=1  i=i 

which  impHes  the  assertion  since  \A\  =  £JLj  n,.  — 

We  now  apply  the  extended  bipartite  graph  lemma  to  the  point  and  sphere  incidence  problem 
where  M  is  a  subset  of  m  vertices  of  the  arrangement  denned  by  the  n  spheres  in  .V.  Let  us  consider 
Q  =  [MuN,  A)  with  the  usual  definition  of  A,  and  pick  s  =  4,  t  —  2,  and  r  -  2n  -  4.  We  argue  that 
those  parameters  satisfy  the  assumptions  of  the  lemma,  provided  n  >  4.  Take  anv  subset  A?  of  m 
points.  Any  three  points  in  A/  are  cocircular  with  at  most  2n  -  7  other  points  which  implies  that  the 
number  of  cocircular  4-tuples  in  A/  is  at  most  [T^)^ifJ-.  Consequently,  the  number  of  non-cocircular 
4-tuples  in  M  is  at  least 

(fh\  2n  -  7  _  m(m  -  l)(m  -  2){m  -  In  +  4)        (fh  -  r)A 
[zj       4        ~  4!  "         4!        ' 

This  proves  the  main  result  of  this  section. 

Canham  Threshold  6.3  The  maximum  sum  of  degrees  of  m  vertices31  in  an  arrangement  of  n 
spheres  in  three  dimensions  is  0(mn3/1  +■  n2). 

Remarks.  (1)  We  noted  before  that  the  sum  of  degrees  can  be  quadratic  in  n  if  m.  the  number 
of  vertices,  is  linear  in  n.  Canham  Threshold  6.3  shows  that  the  maximum  sum  does  not  increase 
beyond  0(n2)  as  long  as  m  =  0(n5/4). 

(2)  The  extended  bipartite  graph  lemma  can  be  used  to  prove  that  the  maximum  sum  of  degrees 
of  m  vertices  in  an  arrangement  of  n  (d  -  l)-spheres  in  d  dimensions  is  0(mn1_iTr  -  nJ_1)  using 
the  same  arguments  as  in  three  dimensions. 

(3)  Another  application  of  the  extended  bipartite  graph  lemma  is  to  bound  the  sum  of  degrees 
of  m  vertices  in  an  arrangement  of  n  hyperplanes  in  d  dimensions.  Using  the  same  line  of  arguments 
as  for  spheres  this  gives  0(mn1-I'd  f  nd~l)  as  an  upper  bound.  Consequently,  the  maximum  sum 
of  vertex  degrees  is  Q(nd~l)  for  nd~2  <m<  nd~2*rXI,i.  This  is,  however,  weaker  than  the  bounds  in 
[19]  which  show  that  the  sum  is  Q(nd~x)  for  m  <  nd-3/2/  log  n. 

The  degree  of  a  vertex,  in  this  case,  is  the  number  of  spheres  that  contain  the  vertex. 
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6.3      Decomposition  of  a  Sphere  Arrangement 

Let  .V  be  a  set  of  n  spheres  j,,  1  <  i  <  n,  defining  the  arrangement  A(N).  The  maximum  number 
of  vertices,  edges,  facets,  and  cells  is  cubic  in  the  number  of  spheres33.  Even  if  these  numbers  are 
maximized,  the  average  number  of  vertices,  edges,  and  facets  per  cell  is  constant.  In  contrast  to  the 
average  case,  the  maximum  number  of  vertices,  edges,  and  facets  bounding  a  single  cell  is  quadratic 
in  n.  (The  upper  bound  of  this  claim  can  be  shown  using  a  lifting  transform  that  maps  the  spheres 
to  hyperplanes  in  four  dimensions;  see  [l5j.) 

Our  goal  is  to  ''triangulate''  A(N),  that  is,  decompose  each  cell  of  A(N)  into  subcells,  the  funnels, 
each  with  a  constant  size  description.  The  main  issues  that  arise  are 

(i)  how  to  define  a  funnel  and  a  decomposition  of  -4(:V)  into  funnels,  and 
(ii)  how  to  bound  the  number  of  funnels  that  arise. 

It  goes  without  saying  that  our  objective  is  to  find  a  decomposition  into  as  few  funnels  as  possible. 
The  type  of  funnel  to  be  used  will  be  chosen  accordingly. 

Figure  6.1  sketches  the  type  of  funnel  we  will  use.  Its  combinatorial  structure  is  that  of  a  cube. 
We  refer  to  its  front,  back,  top.  bottom,  left,  and  right  facets  in  the  sense  suggested  by  Figure  6.1. 
The  front  and  back  facets  lie  in  two  planes  normal  to  the  .r^-axis.  The  top  and  bottom  facets  lie  in 
two  hemispheres34  of  the  sx.  In  Figure  6.1  the  top  facet  is  part  of  a  southern  hemisphere  and  the 
bottom  facet  lies  in  a  northern  hemisphere,  but  all  other  combinations  are  allowed.  The  left  and 
right  facets  are  parts  of  two  vertical  elliptic  cylinders35.  The  left  (as  well  as  the  right)  facet  can  be 
part  of  the  left  or  the  right  part  of  such  a  cylinder;  in  Figure  6.1  the  left  facet  lies  in  the  right  part 
of  a  cylinder  and  so  does  the  right  facet. 

Degenerate  cases  where  some  facets  are  empty  or  at  infinity  are  allowed  and,  in  fact,  occur 
frequently  in  the  decomposition  of  A(y)  to  be  described.  Notice,  however,  that  the  top  and  bottom 
facets  of  a  non-empty  funnel  cannot  be  empty  —  if  they  do  not  lie  in  hemispheres  they  are  at  infinity 
and  the  funnel  is  unbounded. 

We  now  give  a  constructive  definition  of  the  decomposition  of  A(N).  The  construction  takes  four 
steps.  We  assume  general  position  of  the  spheres  when  we  describe  the  decomposition. 

Step  1.  From  every  point  of  each  equator  we  draw  a  maximal  vertical  (relatively  open)  line  segment 
that  avoids  all  other  spheres36.  The  union  of  all  line  segments  drawn  from  a  single  equator 
forms  a  portion  of  a  vertical  circular  cylinder. 


"The  exact  bounds  are  2(™)  for  the  vertices,  6(")  for  the  edges,  6(")  +  2n  for  the  2-faces  or  facet],  and  2(")  -  2n 
for  the  ceils. 

4  A  hemisphere  is  the  part  of  a  sphere  that  lies  above  (northern  hemisphere)  or  below  (jouthern  hemisphere  I  the 
horizontal  plane  through  the  sphere's  center.  The  circle  that  separates  the  northern  from  the  southern  hemisphere  is 
the  equator  of  the  sphere. 

"By  this  we  mean  a  cylinder  whose  axis  is  parallel  to  the  xj-axis  and  whose  intersection  with  the  iii2-plane  is  an 
ellipse.  The  tangent  planes  normal  to  the  ii-axis  meet  the  cylinder  in  two  lines  which  partition  it  into  its  left  and  right 
part. 

18  If  there  is  no  sphere  above  or  below  the  point,  then  the  line  segment  extends  to  infinity  and  is  thus  unbounded.  If 
the  pomt  of  the  equator  belongs  to  another  sphere,  then  the  Line  segment  is  empty. 
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Figure  6.1:  A  funnel  is  denned  by  two  planes,  two  spheres,  and  two  elliptic  cylinders. 


After  Step  1  every  cell  (of  the  refinement)  has  the  property  that  it  intersects  every  vertical  line 
in  a  single  (possibly  empty)  interval. 

Step  2.  Perform  the  same  operation  for  all  points  of  each  circle  3i  r\Sj,  i  5=  j,  that  is.  draw  maximal 
vertical  line  segments  that  avoid  all  spheres  except  that  they  intersect  j,  and  3.  in  a  common 
point.  (Figure  6.2.  left,  shows  the  line  segments  extended  vertically  upwards  from  a  circle:  there 
is  a  sphere  that  blocks  the  line  segments  on  the  right  hand  side).  Since  the  vertical  projection 
of  a  circle  in  three  dimensions  onto  the  r^-plane  is  an  ellipse,  the  onion  of  the  line  segments 
coming  from  a  single  circle  forms  a  portion  of  a  vertical  elliptic  cylinder. 

The  nice  effect  of  Step  2  is  that  every  cell  (of  the  refinement)  has  now  a  unique  top  and  a  unique 
bottom  facet,  each  lying  in  a  hemisphere.  However,  the  number  of  vertical  walls  ( facets  that  are  part 
of  vertical  circular  or  elliptic  cylinders)  of  a  single  cell  can  still  be  arbitrarily  large.  Since  each  cell 
intersects  any  vertical  line  in  at  most  one  interval,  the  vertical  projection  of  a  cell  onto  the  r^-plane 
is  a  connected  (but  not  necessarily  simply  connected)  region  whose  boundary  edges  are  the  vertical 
projections  of  the  walls  of  the  cell.  Hence,  the  edges  are  portions  of  circles  and  ellipses  (see  Figure 
6.2,  right). 

Step  3.  Consider  the  vertical  projection  of  a  cell  onto  the  Zix^-plane.  From  the  Zi -extreme3,  points 
of  each  edge  we  draw  a  maximal  vertical  Line  segment  (parallel  to  the  ,r;-axis)  chat  lies  inside 
the  region  (see  the  broken  vertical  line  segments  in  Figure  6.2.  right).  The  corresponding 
operation  in  three  dimensions  is  to  raise  a  wall  vertically  above  the  Line  segment  inside  the  ceLl. 
We  perform  this  operation  for  every  cell  created  in  Steps  1  and  2. 

After  Step  3,  each  cell  (of  the  resulting  refinement)  has  unique  top,  bottom,  front,  and  back 
facets.  Furthermore,  it  intersects  any  plane  normal  to  the  x^axis  in  a  single  (possibly  empty)  simply 
connected  region. 


,TA  point  of  a  circle  or  ellipse  is  ii -extreme  if  the  circle  or  ellipse  lies  on  one  side  of  the  vertical  line  through  the 
point.  The  ii  -extreme  points  oi  an  edge  that  is  subset  of  a  circle  or  ellipse  are  the  x, -extreme  points  of  thr  circle  or 
ellipse  I  if  on  the  edge). 
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Figure  6.2:  Vertical  wails  are  raised  to  decompose  the  cells  into  funnels  as  defined  earlier. 

Step  4.  Finally,  from  each  vertex  of  the  region  (the  vertical  projection  of  a  cell  onto  the  r^-plane) 
draw  a  maximal  vertical  line  segment  inside  the  region  (see  the  dotted  Lines  in  Figure  6.2, 
right).  Again,  the  corresponding  operation  in  three  dimensions  is  to  raise  a  vertical  wall.  This 
is  done  for  all  cells. 


After  Step  4,  each  cell  has  unique  top,  bottom,  front,  back,  left,  and  right  facets;  in  fact,  every 
cell  is  a  funnel  as  defined  earlier. 

The  remainder  of  this  section  bounds  the  maximum  number  of  funnels  created  in  Steps  1  through 
4.  We  will  repeatedly  use  the  fact  that  two  (different)  ellipses  in  the  plane  intersect  in  at  most  four 
points. 

The  main  part  of  the  analysis  estimates  the  increase  in  the  number  of  cells  caused  by  Step  2. 
This  will  be  done  by  considering  all  vertical  walls  raised  from  a  single  circle  of  the  form  $i  n  s,.  The 
analysis  of  Step  1  is  a  special  case  of  that  of  Step  2  and  can  therefore  be  omitted.  Step  3  adds  at  most 
four  walls  per  circle:  the  projection  of  the  circle  onto  the  iir2'Plane-  which  is  an  ellipse,  has  two 
*x -extreme  points  —  from  both  we  send  a  line  segment  upwards  and  one  downwards  (parallel  to  the 
j:2-axis),  and  each  of  the  four  line  segments  in  the  x^-plane  is  lifted  to  a  wall  in  three  dimensions. 
Since  there  are  at  most  (")  such  circles,  Step  3  increases  the  number  of  cells  by  at  most  4(")  =  0(  n2). 
Step  4  gives  rise  to  two  vertical  walls  per  vertical  edge  created  in  Steps  1  and  2,  so  the  number  of 
such  walls  is  proportional  to  the  number  of  these  edges. 

In  order  to  bound  the  increase  in  combinatorial  complexity  caused  by  Step  2,  we  consider  a  circle 
7  =  sx  n  32  and  the  vertical  cylinder,  C7,  which  is  the  union  of  all  vertical  lines  intersecting  7.  For 
3  <  i  <  n,  we  define  Si  =  C7  n  a,.  Ignoring  degenerate  cases,  £,  is  either  a  simple  closed  curve  in  C, 
(Figure  6.3.  left),  it  consists  of  two  simple  closed  curves  separated  by  two  vertical  lines  in  C-,  (Figure 
6.3,  middle),  or  it  consists  of  two  simple  closed  curves  both  meeting  every  vertical  line  in  C,  (Figure 
6.3,  right).  In  any  case,  we  have  the  following  properties. 
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Figure  6.3:  A  sphere  can  intersect  an  elliptic  cylinder  in  three  different  wavs. 
Lemma  6.4  Unless  7  C  <$,,  7  n  Si  consists  of  at  most  two  points,  for  3  <  i  <  n. 

Proof.  Since  7  =  3\f\S2  and  d',  Z  st  every  point  of  y<~S,  is  also  in  3\  njgDjj  (if  fact,  7  ^6,  =  s.  ~ s?  "s, 
but  we  do  not  need  this  to  prove  the  lemma).  Unless  the  three  spheres  intersect  in  a  common  circle 
(in  which  case  7  C  Si),  the  intersection  consists  of  at  most  two  points.  n 

Lemma  6.5    Unless  j,  =  Sj  or  3,  1  j;  C  Cy,  St  H  S}  consists  of  at  most  four  points,  for  3  <  i,  j  <  n. 

Proof.  6,  H  Sj  C  j,  O  Sj  which  is  a  circle  in  three  dimensions.  The  vertical  projection  of  5,  ~  s,  ontc 
the  r^i-plane  is  an  ellipse  and  so  is  the  intersection  of  Cy  with  the  jri*2-plane.  Unless  the  two 
ellipses  are  equal  (in  which  case  s{  0  j;  C  C\)  they  intersect  in  at  most  four  points.  Those  points  are 
the  projections  of  the  points  of  Si  r\S}.  □ 

A  single  S,  has  at  most  four  points  with  vertical  tangents  in  Cy  (the  intersection  points  of  C, 
with  the  equator  of  5,).  We  remove  from  <5,  these  at  most  four  points  as  well  as  the  at  most  two 
points  of  S,  in  7.  This  leaves  us  with  at  most  six  connected  curves  for  each  St.  These  curves  have 
the  following  properties: 

(i)  each  curve  intersects  any  vertical  line  in  C,  in  at  most  one  point, 
(ii)  each  curve  lies  either  fully  above  or  fully  below  7,  and 
(iii)  any  two  curves  intersect  in  at  most  four  points. 


Because  of  property  (i)  we  can  think  of  each  curve  as  a  partially  defined  continuous  function  from  la 
connected  portion  of)  7  to  the  set  of  real  numbers,  where  such  a  function,  with  graph  /.  measures  the 
difference  in  height  of  in  f  and  ^7,  I  a  vertical  line  in  C,  (if/fl/ia  below  7  then  the  difference  is 
negative).  Because  of  property  (ii)  each  function  /  is  either  always  positive  or  always  negative.  The 
lower  envelope  is  the  pointwise  minimum  of  all  positive  functions:  symmetrically,  we  define  the  upper 
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envelope  as  the  pointwise  maximum  of  all  negative  functions.  We  only  need  to  analyze  the  number  of 
pieces38  of  the  lower  envelope  since  the  analysis  of  the  upper  envelope  is  completely  symmetric.  The 
number  of  pieces  of  the  lower  envelope  is  important  since  it  is  the  same  as  the  number  of  breakpoints 
between  adjacent  pieces.  It  is  at  the  breakpoints  where  Step  2  creates  vertical  edges. 

The  number  of  curves,  graphs  of  functions,  above  7  is  at  most  5n  -  9.  five  per  each  s,  plus  one 
for  the  curve  at  infinity.  We  assign  each  curve  a  unique  integer  between  1  and  on  -  9,  and  write 
down  a  sequence  by  taking  the  pieces  of  the  lower  envelope  and  replacing  each  piece  by  the  number 

of  the  curve  containing  it.   Let  (0.1,0.2, a^)  be  the  resulting  sequence.    We  have  a,  =  at_t  since 

the  pieces  are  maximal,  and  there  is  no  alternation  of  length  eight  or  more  between  two  integers  (see 
e.g.  [53]).  It  follows  that  we  have  a  (5n  -  9.  6)-sequence  and  its  length  is  at  most 

A6(5n-9)  =  0(A6(n})  =  n  •  20(a(n)2) 

(see  Section  5.5).  Steps  1  and  2  operate  on  0(n2)  circles  which  implies  that  the  number  of  vertical 
edges  and  facets  created  is  0(n2A6(n)).  As  mentioned  earlier,  the  increase  in  the  number  of  cells 
caused  by  Steps  3  and  4  is  at  most  0(n2)  plus  a  term  that  is  proportional  to  the  number  of  vertical 
edges  created  in  Steps  1  and  2  which  is  0(n2A6(n)).  We  conclude  with  the  main  result  of  this  section. 

Theorem  6.6  (sphere  triangulation  theorem).  The  cells  in  an  arrangement  of  n  spheres  in  three 
dimensions  can  be  decomposed  into  0(n2A6(n))  funnels. 

Remarks.  (1)  The  decomposition  of  a  sphere  arrangement  described  in  this  section  is  not  a  cell 
complex  since  a  facet  of  one  funnel  can  contain  an  edge  (or  part  thereof)  of  another  funnel.  This  is. 
however,  irrelevant  for  our  use  of  such  decompositions. 

(2)  We  do  not  know  whether  or  not  0(n2A6(n))  is  tight  for  the  decomposition  described.  In  fact. 
we  do  not  even  have  an  example  where  the  number  of  funnels  is  more  than  cubic  in  n.  This  suggests 
two  open  problems.  First,  give  the  correct  asymptotic  order  of  the  maximum  number  of  funnels 
constructed  by  the  above  algorithm.  Second,  if  that  bound  fails  to  be  cubic,  hnd  a  decomposition 
into  0(n3)  funnels,  possibly  with  a  different  definition  of  "funnel"  than  used  in  this  section. 

(3)  We  believe  that  the  above  decomposition  can  be  generalized  to  apply  to  any  collection  of  n 
algebraic  surfaces  of  fixed  maximum  degree  in  three  dimensions,  leading  to  0(n2X,(n))  appropriately- 
defined  funnels,  where  s  depends  on  the  type  of  surfaces  and  their  maximum  degree.  This  result  is 
significant  as  we  can  compare  it  to  the  standard  Collins'  cylindrical  algebraic  decomposition  tech- 
niques 13,  52,  2].  Collins'  technique  also  produces  a  decomposition  of  an  arrangement  of  n  such 
algebraic  surfaces  into  cells  of  constant  description,  but  the  number  of  such  cells  is  enormous  — 
doubly-exponential  in  the  number  of  dimensions.  For  example,  in  three  dimensions  the  number 
of  Collins  cells  that  are  produced  in  general  is  O(n')  (assuming  constant  algebraic  degree  of  the 
surfaces). 

(4)  A  challenging  open  problem  is  to  generalize  the  sphere  triangulation  theorem  to  arrangements 
of  algebraic  surfaces  in  higher  dimensions,  or  even  just  to  arrangements  of  spheres  in  four  or  higher 


Here  we  define  a  ptece  as  a  maximal  connected  subset  of  the  graph  of  the  lower  envelope  that  belongs  to  a  single 
curve.  In  order  to  avoid  intervals  over  which  the  lower  envelope  is  not  defined  we  add  a  function  whose  graph  is  at 
infinity  —  it  can  also  contain  pieces  of  the  envelope. 
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dimensions.  An  equally  challenging  problem  is  to  obtain  a  decomposition  of  anv  collection  C  of 
cells  in  an  arrangement  of  spheres  or  other  surfaces  in  three  dimensions,  into  funnels  of  constant 
description,  whose  total  number  is  roughly  proportional  to  the  complexity  of  the  cells  in  C  plus,  say 
a  quadratic  of  slightly  superquadratic  overhead  term.  Some  initial  investigations  of  these  problems 
in  the  polyhedral  case  are  reported  in   9.  3  . 

(5)  Section  6.4  will  use  the  decomposition  technique  presented  in  this  section  also  for  degenerate 
arrangements  of  spheres.  As  explained  in  Section  3  for  the  case  of  lines,  our  policy  will  be  to 
conceptually  perturb  the  spheres  so  that  they  form  a  non-degenerate  arrangement  and  to  triangulate 
this  arrangement.  The  resulting  triangulation  corresponds  to  a  triangulation  of  the  unperturbed 
arrangement  which  contains  many  zero-measure  funnels. 

(6)  The  triangulation  of  a  sphere  arrangement  as  defined  in  this  section  satisfies  the  independence 
lemma  (Lemma  5.1)  with  the  size  of  sets  R'  bounded  from  above  by  six.  Based  on  this  observation 
we  can  now  extend  the  sampling  lemma  (Lemma  5.3)  to  the  case  of  spheres.  This  can  indeed  be 
done  without  any  difticulty,  except  for  one  step  namely  Lemma  5.2  (ii)  which  is  necessary  to  bound 
the  sum  of  the  n,.  This  step  can  be  replaced  by  a  tail  estimation  as  described  in  Clarkson  12  which 
then  shows  that  for  every  6  <  r  <  n  there  exists  a  sample  of  size  r  so  that  the  average  number 
of  spheres  intersecting  any  one  funnel  is  O(nlr).  An  even  stronger  result,  that  the  expected  value 
of  the  square  of  the  number  of  such  spheres  is  0((n/r)2),  can  also  be  derived.  We  refer  to  12'  for 
details.  The  extension  of  the  sampling  lemma  to  spheres  will  be  used  in  the  next  section. 

6.4      General  Bounds  on  the  Number  of  Incidences 

LTsing  the  results  of  Sections  6.1  through  6.3  we  can  now  derive  improved  upper  bounds  on  the 
maximum  number  of  incidences  between  m  points  and  n  spheres  in  three  dimensions.  We  do  this 
for  the  cases  where  no  three  spheres  intersect  in  a  common  circle  and  where  the  points  are  vertices 
of  the  arrangement  defined  by  the  spheres.  In  both  cases,  the  analysis  starts  with  a  random  sample 
of  r  of  the  spheres  and  with  a  decomposition  of  the  cells  in  the  arrangement  as  described  in  Section 
6.3.  The  sampling  lemma  in  Section  5.2  implies  that  there  exists  such  a  sample  with  the  desired 
properties  detailed  there. 

Spheres  in  General  Position.  As  mentioned  before,  we  choose  r  of  the  n  spheres  and  triangulate 

the  arrangement  defined  by  the  r  spheres.     This  is  a  decomposition  of  the  cells  into  a  total  of 

k  =  0(r2,\6(r))  funnels39.  We  number  the  funnels  from  1  through  k  and  let  m,  denote  the  number 

of  points  contained  in  the  ith  funnel  and  n,  be  the  number  of  spheres  intersecting  this  funnel"10.   If 

I{m,n)  is  the  maximum  number  of  incidences  between  m  points  and  n  spheres  in  general  position. 

then 

k 

I(m,  n)  <^2  /(m,,  nx)  -  /(m,  n,  r), 

1=1 

where  J(m,  n,  r)  is  the  maximum  number  of  incidences  between  m  points  and  n  spheres  under  the 
restriction  that  the  m  points  lie  on  r  spheres.  The  second  term  of  the  right  side  bounds  the  number 


"Recall  that  a  funnel  is  an  open  set. 
Since  we  can  rotate  the  configuration  of  spheres  and  points  we  can  assume  that  a  point  lies  on  the  boundary  of  a 
funnel  only  if  it  belongs  to  at  least  one  sample  sphere. 
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of  incidences  that  occur  on  the  sample  spheres,  the  first  term  takes  care  of  ail  other  incidences. 

An  upper  bound  on  T^1=l  /(m,,  n,)  can  be  obtained  using  the  first  bound  in  Canham  Threshold 
6.1  and  the  sampling  lemma  for  spheres  (see  remark  (6)  at  the  end  of  Section  6.3).  In  particular,  this 
lemma  implies  that  there  is  a  sample  of  r  spheres  so  that  the  average  rc,  is  0(n/r)  and  J)jL1  rntn~'  3  = 
0(m(rc/>)2/3).  We  thus  get 

£/(m„  m)  =  V  0(mxrc2/3  +  rc,)  =  0(m(-)2/3  +  *-)  =  0{mn^r^3  4-  r\6(r)n). 

The  two  terms  of  the  final  bound  balance  at  r  =  0(ro3/8rc~l/,8(  *  mp — )  )  (which  can  be  shown 
using  the  extremely  slow  growth  of  -^A)  in  which  case  we  have 

k 

£/(m„rc,)  =  0(m3/4rc3/4/3,(m,rc)1/4), 
i=i 

\  i  m3  i 
where  we  define  3,(m,  n)  =    "^    .  (This  choice  of  r  makes  sense  as  long  as  m  =  H(rc1/3);  see  below 

for  the  other  case.) 

In  order  to  bound  J(m,  n,r)  we  index  the  r  sample  spheres  from  1  through  r  and  let  m;  denote 
the  number  of  points  on  the  jth  sample  sphere.  The  sum  of  the  m_,  can  exceed  m  since  one  point 
can  lie  on  several  sample  spheres.  Nevertheless  we  have 

r 

]jr  rhj  <  I(m,  r)  =  0(rm2/3  +  m) 
using  the  second  bound  of  Canham  Threshold  6.1.  Plugging  in  the  value  of  r  as  chosen  above  gives 

r 

£  m;  =  0(m25/24n-l/8/3J(m,  np3/8  +  m)  =  0(m) 
]=i 

if  m  <  rc3,  which  we  can  assume  since  any  point  that  is  not  a  vertex  of  the  arrangement  of  spheres  is 
incident  to  at  most  two  spheres  by  assumption.  The  number  of  incidences  counted  by  J(m.  n,  r)  can 
now  be  bounded  by  considering  the  r  sample  spheres  in  turn.  The  jth  sample  sphere  contains  m; 
points  and  intersects  the  other  spheres  in  less  than  n  circles  all  of  which  are  distinct  by  the  general 
position  assumption.  Thus,  the  problem  becomes  an  incidence  counting  problem  for  m;  points  and 
n  circles  on  a  sphere.  For  this  we  have  an  upper  bound  of  the  form 

0(m3/5n4/5  +  m;  +.  n), 

see  the  remark  (4)  after  the  planar  incidence  theorem  (Theorem  5.4).  Therefore 

r 

7(m,n,r)  =  0(  V(m3/V/5  +  fh:  +  rc))  =  0(m3/5rc4/5r2/5  +  m  +  rrc). 
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After  plugging  in  the  above  value  of  r  we  get 

Jim.  re,  r)  =  0(m3/4n3/43,(m>  n)'3'20  -  m  -  m*f9n7'*0.{m, n)~3'8)  =  0(m3'V/4  -  m) 

as  long  as  m  =  ft(n1/3).  Finally,  for  m  =  0(n1,3)  Canham  Threshold  6.L  implies  that  I(m,n)  =  0(n). 
This  completes  the  analysis  of  incidences  for  spheres  with  no  thre«  intersecting  in  a  common  circle. 
The  result  is  summarized  in  the  following  theorem  which  improves  the  bounds  given  in  Canham 
Threshold  6.1. 

Theorem  6.7  The  maximum  number  of  incidences  between  m  points  and  n  spheres  in  three  dimen- 
sions (assuming  that  no  three  intersect  in  a  common  circle)  is  0(m3/4n3/4  j,(  m.  rc)1'4  -  m  -  n). 

Remarks.  ( 1 )  0(  n3  4-  m)  is  a  trivial  upper  bound  on  the  number  of  incidences  between  m  points  and 
n  spheres  no  three  of  which  intersect  in  a  common  circle.  The  bound  given  in  Theorem  6.7  exceeds 
this  bound  for  a3J,(m,  rc)-1/3  <  m  <  n3j,(m,  n).  In  view  of  the  fact  that  3,{m,n)  is  a  small 
constant  unless  m  is  unreasonably  large41,  the  bound  in  Theorem  6.7  is  not  far  off  the  insignificantly 
more  refined  bound 

0(  min{m3/4n3/4jJ(m,  n)1/4,   n3}  -  m  +  n). 

We  view  this  as  a  strong  indication  that  the  3s(m,  n)-term  is  an  artifact  of  our  analysis.  Indeed,  it 
is  inherited  from  the  analysis  of  the  triangulation  method  given  in  Section  6.3. 

(2)  It  is  not  likely  that  the  bound  in  Theorem  6.7  turns  out  to  be  tight,  even  if  we  ignore  the 
J,(m,  ri)-term.  A  lower  bound  construction  which  achieves  ft(n4/3  log  log  re)  incidences  for  m  =  n 
can  be  found  in  [25]  (see  also  Section  6.5  below). 

(3)  The  analysis  used  to  derive  Theorem  6.7  can  also  be  used  to  improve  upon  the  bound  on  the 
maximum  number  of  incidences  between  m  points  and  n  spheres  where  we  assume  that  no  four  of 
the  points  are  cocircular  (see  remark  (3)  after  Canham  Threshold  6.1).  The  bounds  that  we  obtain 

are  0(m5/7rc6/T.3r(m,  n)2/7  +  m  +  n)  and  0(nm1/2  +  m),  where  Jr(m,  re)  =  M  f  }.  The  first  bound 

is  a  minor  improvement  of  the  bound  obtained  straight  from  the  bipartite  graph  lemma  (  Lemma  4. 1 ) 
which  is  0(mn4/5  +  a),  see  remark  (3)  after  Canham  Threshold  6.1.  It  seems  Likely  that  a  "dual" 
approach  as  described  in  [19]  will  yield  a  better  upper  bound  than  this. 

(4)  Theorem  6.7  implies  that,  given  a  set  of  m  points,  the  maximum  number  of  spheres  that 
contain  at  least  k  >  2  points  each  is  Of^t^m)  +  £),  where  J  is  a  generic  and  extremely  slowly 
growing  function.  Symmetrically,  the  maximum  number  of  points  that  Lie  on  at  least  k  >  2  spheres 
each  is  0(  fr,j(n)  ■+■  J).  Of  course,  here  we  talk  only  of  spheres  such  that  no  three  intersect  in  a 
common  circle. 

(5)  The  method  in  Chung  [10]  can  be  used  to  extend  the  bound  in  Theorem  6.7  to  four  and 
higher  dimensions  (using  our  three-dimensional  bound  as  the  base  case).  The  result  is  then  that  the 
maximum  number  of  incidences  between  m  points  and  n  (d  -  1  (-spheres  in  d  dimensions  is 

i  d 

0(m4*'  n**1  J(m,  n)  ~  m  +■  n) 


"We  mean  really  unreasonable,  that  13.  lar^e  so  that  the  number  of  digits  needed  to  write  m  exceeds  the  number  or 
electrons  in  the  universe. 
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if  no  d  of  our  (d  -  l)-spheres  intersect  in  a  common  L-sphere  (a  circle).  This  improves  the  original 
bound  in  [lOi  which  is  0(m(li+1)/^2,n('i+1)/(<J+2>  +  mn^-21^-"  4-  n).  It  is  interesting  to  compare 
this  with  the  "in-between"  bound  0(m^d-\)/(2d~i)nr.d)/(2d+\)  +  m  _  n)  wnich  we  obtain  if  we  base 
the  approach  in  [10]  on  our  bound  for  points  and  circles. 

Vertices  in  Sphere  Arrangements.    The  analysis  of  this  case  is  similar  to  that  of  spheres  in 

general  position,  the  main  difference  being  that  we  now  use  Canham  Threshold  6.3  in  place  of 

Canham  Threshold  6.1  to  bound  the  number  of  incidences  in  each  of  the  k  =  0(r2,\6(r))  funnels. 

Again  we  write  I(m,n)  for  the  maximum  number  of  incidences  between  m  points  and  n  spheres, 

where  now  the  points  are  assumed  to  be  vertices  of  the  arrangement.  Furthermore,  we  write  m,  for 

the  number  of  points  in  the  ith  funnel,  n,  for  the  number  of  spheres  intersecting  the  ith  funnel,  and 

J(m,  n,  r)  for  the  maximum  number  of  incidences  if  the  m  points  are  restricted  to  lie  on  r  spheres. 

Thus,  we  have 

h 

I(m,n)  <  )    I(mj, n,)  +  J(m,n,r). 

i=i 

Using  Canham  Threshold  6.3  and  the  sampling  lemma  for  spheres  (see  above)  we  bound  the  first 
term  on  the  right  side  as  follows. 


V/fm.n,)  =  V  0(mtn3/4  4-  n,2)  =  0(m(-)3/4  4-  fc(-)2)  =  0(m^V^  +  n2A6(r)). 
—  . — .  r  j. 


m* 


The  two  terms  in  the  last  bound  balance  if  r  =  0(m4/'n"5/7f3„(m,  n)-4/'7),  where  J„(m.  n)  =     *[mf    . 

~^~ 
In  this  case  we  have 

k 

£/(mt,nt-)  =  0(m4/7n9/7/3v(m,n)3/r). 


1=1 


In  order  to  bound  J(m,n,r)  we  index  the  r  spheres  from  1  through  r  and  define  m:  as  the  number 
of  points  on  the  j*    sphere.  We  have 

r 

£]  fhj  <  I(m,  r)  =  0(r3  4-  m). 
With  the  choice  of  r  as  above  we  get 

r 

£  m:  =  0(m12/7n-15/7/3„(m,  n)"l2/7  4-  m)  =  0(m) 

as  long  as  m  =  0(n3)  which  holds  since  any  arrangement  of  n  spheres  has  at  most  2Q)  vertices. 
J(m,  n,  r)  can  now  be  bounded  by  considering  the  circles  on  the  sample  spheres  which  are  the 
intersections  with  the  other  spheres. 

r 

J[m,  n,  r)  =  0(  £](m3/V/5  +  %  +  n))  =  0(m3/5n4/5r2/5  +  m  -  rn)  = 

3=1 

=  0(m29/35n18/35^v(m,M)-6/35  4-  m  +  m4'7 V''7 /3,(m,  a)"4/7)  =  0(m4'V7  4-  m) 
as  long  as  m  —  0(n3).  This  completes  the  analysis  for  m  vertices  in  an  arrangement  of  n  spheres. 
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Theorem  6.8  The  maximum  sum  of  degrees  of  m  vertices  in  an  arrangement  of  n  spheres  in  three 
dimensions  is  0(m4/7n9/7|3„(m.  n)3/7  +  n2). 

Remarks.  (1)  Clearly  the  sum  of  degrees  cannot  exceed  Q(mn)  and  0(n3);  the  first  bound  improves 
Theorem  6.8  if  m  =  0(n),  the  second  bound  does  so  if  m  =  fi(n3jv(m.  ra)~3/4).  A  refinement  of  the 
bound  in  Theorem  6.8  is  therefore 

0(min{mn.  m4/7n9/7Ju(m,  n)3/7,  n3}-min{mn.  n2}). 

While  the  improvement  for  small  m  is  substantial  but  trivial,  the  improvement  for  large  m  is  insignif- 
icant and  due  to  an  artifact  of  our  analysis  which  gives  rise  to  the  nearly  constant  3v{m.  n)-factor  in 
the  bound. 

(2)  It  is  likely  that  the  bound  in  Theorem  6.8  is  not  tight,  even  if  we  ignore  the  J„(m,  rc)-factor. 
A  further  improvement  hinges  on  an  improvement  of  Canham  Threshold  6.3  —  using  the  techniques 
of  this  paper,  an  improvement  of  Theorem  6.3  would  immediately  follow. 

(3)  By  Iheorem  6.8  the  maximum  number  of  vertices  in  an  arrangement  of  n  spheres  that  have 
degree  k  or  higher  is  0(  j-775-  ,j(n)  4-  2L). 

6.5      Applications  to  Distance  Problems 

Theorem  6.7  can  be  used  to  improve  known  upper  bounds  for  various  distance  problems  in  three 
dimensions.  We  briefly  discuss  the  history  of  two  such  problems,  state  our  improvements,  and 
remark  on  additional  distance  problems.  In  each  case,  the  reduction  to  Theorem  6.7  is  done  bv 
drawing  spheres  of  appropriate  radii  around  the  points  so  that  the  counted  distances  correspond  to 
incidences. 

Unit-distance  Pairs.  This  problem  was  originally  posed  by  Erd5s  who  reports  initial  results  in  24 
and  25i.  The  problem  is  to  bound  the  maximum  number  of  unit-distance  pairs42  in  a  set  of  m  points 
in  three  dimensions.  "25]  proves  that  fi(m4/3loglog  m)  is  a  lower  bound  and  0(m5  3)  is  an  upper 
bound  for  this  problem.  This  was  subsequently  improved  by  Beck  [5]  to  0(ml3/8^e),  :  >  0.  and  bv 
Chung  10]  to  0(m8/s).  All  three  upper  bound  proofs  draw  a  unit-sphere  around  each  point  and  base 
the  argument  on  the  fact  that  {p,  q}  is  a  unit -distance  pair  if  and  only  if  p  lies  on  the  unit-sphere 
around  q  which  is  equivalent  to  q  lying  on  the  unit-sphere  around  p.  The  number  of  incidences  for 
the  m  points  and  the  m  unit-spheres  is  thus  twice  the  number  of  unit-distance  pairs.  Theorem  6.7 
applies  since  no  three  equally  large  spheres  in  three  dimensions  intersect  in  a  common  circle. 

Corollary  6.9  (unit-distance  theorem).  The  maximum  number  of  unit-distance  pairs  in  a  set  of  m 
points  in  three  dimensions  is  0(m3/:(   ,<fT")  '    ). 

Remark.  ( 1 )  Without  fear  of  redundancy  we  point  out  that  the  multiplicative  factor,  (  m< )  is 
a  small  constant  unless  m  is  unreasonably  large.  It  is  probably  an  artifact  of  our  proof  technique, 
more  particularly  of  the  tnangulation  of  a  sphere  arrangement  described  in  Section  6.3. 

A  unit-distance  pair  consist  of  two  point]  that  are  one  mut  of  distance  apart.  Since  any  point  set  can  be  increased 
or  decreased  bv  scaling,  one  unit  is  just  any  riied  positive  distance.  Thus,  the  problem  is  really  to  bound  the  maximum 
number  of  times  the  most  popular  distance  can  occur. 
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(2)  It  is  interesting  to  observe  that  the  method  of  Chung  [10]  which  improves  ErdoV  initial  upper 
bound  to  0(m3/5)  can  be  further  improved  to  0(mu/')  just  by  using  the  planar  incidence  theorem 
for  points  and  circles  in  the  plane  (Theorem  5.4  (ii)).  This  bound  is  still  inferior  to  the  bound  in  the 
unit-distance  theorem  (see  also  remark  (5)  after  Theorem  6.7). 

Furthest  Neighbor  Pairs.  The  problem  here  is  to  bound  the  maximum  number  of  furthest 
neighbor  pairs43  in  a  set  of  m  points  in  three  dimensions.  It  is  easy  to  see  that  the  maximum  number 
of  furthest  neighbor  pairs  in  a  set  of  m  points  in  three  dimensions  is  fi(m-);  the  constant  in  front 
of  the  m2  and  lower  order  terms  can  be  found  in  [4].  As  observed  by  Edelsbrunner  and  Skiena  22!, 
the  worst  case  cannot  be  realized  if  no  three  points  are  collinear;  for  this  case  they  prove  0(mD/3)  as 
an  upper  bound.  This  was  improved  to  0(md/5)  by  Chung  [10].  The  relation  to  incidence  counting 
becomes  clear  when  we  draw  around  each  point  a  sphere  that  goes  through  all  furthest  neighbors  of 
the  point.  Now,  (p,  q)  is  a  furthest  neighbor  pair  if  and  only  if  q  lies  on  the  sphere  around  p.  No  three 
of  the  m  spheres  intersect  in  a  common  circle  since  their  centers  are  not  collinear  by  assumption. 
Using  Theorem  6.7  we  thus  get  the  following  result. 

Corollary  6.10  (furthest  neighbor  theorem).  The  maximum  number  of  furthest  neighbor  pairs  in 
a  set  of  m  points  in  three  dimensions  (no  three  of  which  are  collinear)  is  0(ro3^2(   6(m))      ). 

Remarks.  ( 1)  In  contrast  to  the  unit-distance  problem,  no  superlinear  lower  bound  on  the  maximum 
number  of  furthest  neighbor  pairs  is  known.  A  related  result  is  that  the  maximum  number  of  diameter 
pairs44  in  a  set  of  m  points  in  three  dimensions  is  2m  -  2  (see  [38],  [33],  [55]).  This  bound  also  holds 
for  the  maximum  number  of  symmetric  furthest  neighbor  pairs45.  One  can  take  these  results  as 
indications  that  the  maximum  number  of  furthest  neighbor  points  is  also  linear  in  the  number  of 
points  but  no  proof  is  known. 

(2)  Of  course,  Theorem  6.7  can  also  be  used  to  bound  the  maximum  number  of  minimum  distance 
pairs46  in  three  dimensions,  but  a  better  (linear)  upper  bound  can  be  proven.  This  proof  (which  uses 
a  packing  argument)  cannot  be  extended  to  the  case  where  each  point  is  colored  either  red  or  blue 
and  only  distances  between  differently  colored  points  are  considered.  Theorem  6.7  still  applies  and 

shows  that  0(m3/2(  -t~)  )  is  also  an  upper  bound  for  the  bichromatic  version  of  the  minimum 
distance  problems.  It  would  be  desirable  to  find  a  superlinear  lower  bound  if  there  is  one.  (This 
problem  was  also  proposed  by  Janos  Pach  at  the  Second  Computational  Geometry  Day  in  New 
York,  1986.)  In  the  two-dimensional  case,  the  maximum  number  of  minimum  bichromatic  distances 
is  0(m)  because  the  graph  denned  by  minimum  distance  pairs  is  planar  (see  [291). 

(3)  For  the  bichromatic  maximum  distance  problem  the  proof  for  the  monochromatic  case  (see 
remark  (1)  above)  can  be  extended  to  show  a  linear  upper  bound  as  follows.  Around  each  red  point 
we  draw  a  closed  ball  with  radius  equal  to  the  maximum  bichromatic  distance.  Call  this  distance  6. 
For  m  >  3  the  intersection  of  the  m  balls  has  the  structure  of  a  convex  polytope  with  at  most  m 
(spherical)  facets,  at  most  3m  -  6  edges,  and  at  most  2m  -  4  vertices.  The  sum  of  all  vertex  degrees 
is  therefore  at  most  6m  -  12.  By  construction,  this  polytope  contains  all  blue  points.  If  a  red  point 


4   A  directed  pair  (p,  q)  is  a  furthest  neighbor  pair  of  a  Unite  point  set  P  if  dip,  q)  =  max{d(p,  z)\x  6  P}. 
44  (p,  q}  is  a  diameter  pair  of  a  set  P  if  d(p,  q )  ss  majc{d(i,  y)|r,y  €  P}. 

45 {p,  q}  is  a  symmetric  furthest  neighbor  pair  d(p,q)  and  iq,p)  are  furthest  neighbor  pairs, 
{p,  q)  is  a  minimum  distance  pair  of  a  set  P  if  dip,  q)  =  min{<i(  X,  y)\z,u  £  P}. 
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does  not  generate  a  facet  of  the  polytope  then  the  sphere  that  bounds  the  bail  of  the  point  can  touch 
the  polytope  in  at  most  one  point.  It  follows  that  the  red  point  can  realize  the  maximum  distance 
to  at  most  one  blue  point.  We  count  the  other  maximum  distance  pairs  for  the  blue  points.  If  a  blue 
point  Lies  on  a  facet  it  has  one  red  point  at  distance  6,  if  it  lies  on  an  edge  it  has  two  red  points  at 
distance  6,  and  if  it  is  a  vertex  of  the  polytope  then  the  number  of  such  red  points  is  equal  to  the 
degTee  of  the  vertex  —  in  the  latter  two  cases  we  ignored  red  points  that  do  not  generate  facets. 
Blue  points  in  the  interior  of  the  polytope  have  no  red  point  at  distance  8  which  implies  the  claimed 
linear  bound. 

(4)  As  in  Section  5.4,  we  can  use  upper  bounds  on  the  number  of  incidences  to  derive  lower 
bounds  on  the  number  of  different  distances  denned  by  m  points.  Let  pi.p^, . .  .  ,pm  be  the  m  points 
of  set  P  and  let  g,  be  the  number  of  different  distances  from  point  p,,  that  is,  gt  is  the  cardinality 
of  {d(p„p:)\l  <  j  <  n,  j  j4  t}.  Around  each  p,  we  draw  g,  spheres  through  the  other  points.  Thus. 
we  have  m  points,  n  =  £Jlt  <7«  spheres,  and  2(7)  incidences.  If  we  assume  that  no  three  points  are 
collinear  we  can  use  Theorem  6.7  and  get 


which  implies  that 


2H  =  0(m3«n3'*j3,(m,n)1'*) 


m 


In  words,  the  average  (and  therefore  also  the  maximum)  gt  is  fi(m:/3(  *'  )  ).  Strangely  enough, 
the  m  points  arranged  in  a  cubic  grid  realize  0(m2/3)  different  distances  each,  but  they  violate  the 
collinearitv  restriction  under  which  the  lower  bound  is  obtained. 


7     Discussion  and  Open  Problems 

The  main  contribution  of  this  paper  is  a  uniform  and  modular  approach  to  a  variety  of  combinatorial 
incidence  and  many-faces  problems.  Some  of  the  bounds  obtained  with  this  approach  are  tight  (for 
example,  the  bounds  for  the  incidence  and  many-faces  problems  for  Lines  and  pseudolines),  some  are 
tight  up  to  a  J(rc)-factor  (the  bound  for  the  many-faces  problem  for  unit-circles),  and  some  still  leave 
large  gaps  to  be  closed.  Bounds  on  incidence  problems  can  be  used  to  get  bounds  on  combinatorial 
distance  problems,  such  as  the  problem  to  count  the  maximum  number  of  unit-distance  pairs  in  a 
set  of  n  points  in  three  dimensions.  For  this  and  for  other  distance  problems  we  obtain  improved 
bounds,  but  except  for  the  unit-distance  problem  for  points  on  a  sphere  in  three  dimensions,  no  such 
bound  seems  to  be  tight. 

Narrowing  or  closing  the  gaps  that  the  approach  of  this  paper  leaves  is  a  challenging  open  prob- 
lem (see  Section  2  for  a  summary  of  the  results  of  this  paper).  More  specialized  open  problem 
are  mentioned  throughout  the  paper  and  we  hope  that  some  will  be  the  starting  point  for  future 
developments  in  this  area. 

The  reader  of  this  paper  might  wonder  why  this  paper  makes  no  attempt  to  solve  incidence 
and  many-faces  problems  that  seem  closely  related  to  problems  successfully  tackled  in  this  paper. 
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Examples  are  the  incidence  and  many-faces  problems  for  planes  and  hyperplanes.  The  generic  answer 
is  that  some  step  of  our  modular  proof  technique  breaks  down  or  gives  results  that  are  too  weak 
for  any  new  bounds.  Below,  we  list  more  specific  reasons  why  our  method  fails  for  a  few  interesting 
cases. 

(1)  The  Canham  threshold  obtained  for  m  vertices  in  an  arrangement  of  n  (hyper-)  planes  (using 
the  extended  bipartite  graph  lemma  (Lemma  6.2))  is  weaker  than  the  bounds  derived  in  [19' 
using  a  different  approach  altogether. 

(2)  The  bipartite  graph  lemmas  (Lemmas  4.1  and  6.2)  seem  to  be  of  little  use  for  establishing 
a  Canham  threshold  for  many-faces  in  a  line  segment  arrangement.  Again,  a  quite  different 
approach  (see  [20] )  leads  to  bounds  that  are  almost  tight. 

(3)  The  bounds  on  the  incidence  problem  for  spheres  in  three  dimensions  fail  to  generalize  to  the 
corresponding  many-faces  problem  because  of  the  lack  of  a  zone  result  for  spheres. 

(4)  The  major  obstacle  in  generalizing  the  incidence  bound  for  three-dimensional  spheres  to  four 
and  higher  dimensions  is  the  lack  of  appropriate  triangulation  results. 
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